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FOREWORD 


This volume contains 12 papers prepared by agencies working 
in trajectory analysis and guidance theory with the Computer 
Research Laboratory of the NASA Electronics Research Center. 

The papers are concerned with special studies performed in 
guidance theory, optimization theory, numerical methods, and 
celestial mechanics. They include: 

1. An extension of the classical theory of calculus of 
variations to include varying number and types of 
cons t r a in t s ; 

2. A development of theory for relaxed controls for 
integral equations; 

3. A generalization of the above case to one where the 
class of controls may, but need not, consist of 
relaxed controls; 

4. An application of Ham i 1 t on- J acob i theory to a planar 
trajectory optimization problem; 

5. A presentation of a method of obtaining a complete 
integral of the H ami 1 t on- Jac ob i equation associated with 
a dynamical system in which constants of motion are 
known ; 

6. A method of solving two-point boundary-value problems 
by an offset vector iteration method; 

7. A linearized guidance procedure based on minimum 
impulses for space trajectories; 

8. A set of equations for computing orbits in closed form 
using the spheroidal method of calculation; in particular, 
they are good for polar and near-polar orbits; 

9. A procedure for developing expansion formulas in 
canonical transformation in which the form is developed 
for speedy computerized symbolic manipulation; 

10. A formal solution of the n-body problem in Taylor 
series ; 

11. A paper on the long period behavior of a close lunar 
o r b i t e r ; 


FOREWORD 


12. A presentation of non-linear resonance theory with 
an application. 

These papers cover work performed from 1 October 1967 to 
1 February 1969. This work was supervised by personnel of the 
Computer Research Laboratory. 


SUMMARY 


This volume contains technical papers on NASA- 
sponsored studies in the areas of trajectory analysis 
and guidance theory. These papers cover the period 
beginning 1 October 1967 and ending 1 October 1968. 

The technical supervision of this work is under the 
personnel of the Computer Research Laboratory at NASA-ERC. 
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INTRODUCTION 

By William E. Miner 
Chief, Computation Theory 
and Techniques Branch 
NASA Electronics Research Center 


This document contains 12 technical papers covering work 
sponsored by the Computer Research Laboratory of the NASA 
Electronics Research Center in the fields of guidance theory, 
optimization theory, numerical methods, and celestial mechanics 

The following table lists the authors, contributing 
institutions, and the disciplines of each paper. 
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Arkansas State Univ. 
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Univ. of So. Calif. 


Optimization Theory 
Optimization Theory 
Optimization Theory 
Optimization Theory 

Numerical Methods 
Guidance Theory 
Celestial Mechanics 
Celestial Mechanics 
Celestial Mechanics 

Celestial Mechanics 
Celestial Mechanics 


The above characterization is made only in a general way. 
Work done in optimization theory may have application in 
trajectory analysis, control theory, guidance theory, and/or 
celestial mechanics. Work done in celestial mechanics often 
overlaps into the area of optimization theory with potential 
applications to that theory. Numerical methods find usages 
in many disciplines. 



INTRODUCTION 


Synopses of the individual papers are presented below: 

Paper No. 1 

The first paper, written by J. L. Linnstaedter of Arkansas 
State University, presents a generalized, multistage problem of 
Bolza in the calculus of variations. The differential con- 
straints and the number of differential constraints may be 
different for each of the various stages. The stages are allowed 
to degenerate. Discontinuities at staging points are permitted. 
The paper presents a multiplier rule and analogues of the 
Weierstrass and Clebsch conditions. 

Paper No , 2 


The second paper, written by J. Warga of Northeastern 
University, covers relaxed controls for functional equations 
where the functional values of the state are considered as 
known functions of states and controls. These equations are 
constrained by known functions of the states and controls, and a 
function of the controls and states is minimized. The controls 
are embedded in a set of "relaxed controls" so that the exist- 
ence of a relaxed minimizing point and an approximate solution 
may be obtained under mild assumptions. Theorem 2.1 presents 
the results described above. The proof is presented in 
paragraph 5. The paper presents theorems based on the special 
case of a control problem defined by a Uryson-type integral 
equation . 

Paper No. 3 

The third paper, also written by J. Warga of Northeastern 
University, is a generalization of the second paper in this 
compilation. The existence of an original (unrelaxed) control 
is assumed. It is shown that the generalizations of the 
Weierstrass E-condition and the t ransve rsal i ty conditions 
presented in the second paper remain essentially valid for an 
original control. The generalization is in the sense that the 
class of controls may, but need not, consist of relaxed controls. 
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Paper No . 5 

The fifth paper, written by P. M. Fitzpatrick and 
J. E . Cochran of Auburn University, covers the use of Liouville's 
theorem for deriving a generating function for transformations 
of a Hamiltonian system. Methods are developed for making 
transformations which make use of the known constants of 
integration by putting the variables and constants in the form 
so that Liouville's theorem may be applied. The methods are 
then applied to two examples. The examples are the orbit in 
the central force field and free motion of a triaxial rigid 
body. 

Pape r No . 6 

The sixth paper, written by C. F. Price of Massachusetts 
Institute of Technology, presents an offset vector iteration 
method for solving two-point, boundary- value problems along 
with a modification. The method depends on an "approximate 
solution". It has the distinct advantage of moving toward 
the desired solution with each pass through the ordinary 
differential equations of motion and, therefore, if the 
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Paper No , 7 


The seventh paper, written by T. N. Edelbaum of Analytical 
Mechanics Associates, Inc., presents a linearized guidance 
procedure for a space trajectory. The space trajectory is a 
minimum-fuel trajectory and the thrusting is impulsive. The 
guidance corrections are impulsive and are designed to be 
valid in the neighborhood of the nominal trajectory. This 
paper covers three different problems; (1) the time-open 
rendezvous case, (2) the time-open orbit transfer, and 
(3) the time -open orbit transfer where one or more finite 
impulses are tangent to the velocity vector. 

Paper No. 8 

The eighth paper written by J. P. Vinti of Massachusetts 
Institute of Technology presents a set of equations for 
computing orbits in closed form using the spheroidal method 
(Vinti potential) of calculation. The equations are good in 
the general case and in particular they are good for polar 
and near polar orbits. The paper develops the changes in the 
known equations so that near the polar orbits division by 
differences of near equal quantities (near zero) is avoided. 
Thus, the numerical accuracy is enhanced. The procedure trans- 
forms the equations so that an explicit parameter for the 
right ascension does not appear. This is the troublesome 
var iab le . 

Paper No. 9 

The ninth paper, written by A. A. Kamel of Stanford 
University, presents procedures for developing expansion 
formulae in canonical transformations depending on a small 
parameter where the implementation of such perturbation theory 
is put in a form for speedy computerized symbolic manipulation. 
"Deprits 1 equations" are developed using a linear operator 
called Lie derivative generated by W, where the 

generating function W has a special form. Recursive relation- 
ships of the transformed variables and Hamiltonian are then 
developed. These recursive relationships are then modified 
by the introduction of intermediate functions to increase the 
speed of computerized symbolic manipulations. 
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Paper No . 10 

The tenth paper, written by P. Sconzo and D. Valenzuela 
of International Business Machines’ Cambridge Advanced Space 
Systems Department, presents a formal solution of the n-body 
problem. This solution is a Taylor series in time for each of 
the 3n variables with coefficients generated recursively from 
the 6n initial conditions. It is obtained by a careful selection 
of intermediate variables and by the use of PL/1 FORMAC. 

Paper No. 11 


The eleventh paper, written by R. Dasenbrock of 
Stanford University, is on the long period behavior of a close 
lunar orbiter. A reference frame is chosen which is rotating 
with the moon with the x-axis in the equatorial plane determined 
by Cassini’s law and the z-axis along the axis of rotation of 
the moon. The Hamiltonian is written in this rotating system in 
mixed Keplerian and Delaunay variables. The parts of the 
Hamiltonian are then ordered and integration of the equations 
is obtained with the short period terms averaged out by a 
series of canonical transformations. It is pointed out that 
there are 11 critical inclinations. Near these inclinations 
the von Zeipel method, which was used, fails. The case of the 
polar orbit is discussed separately. Phase plane contours of 
H "and h" with constant F** are presented and discussed. This 
work is a continuation of earlier work done by J. Vagners 
documented in NASA-ERC PM-67-21, pp . 213-228. 

Paper No, 12 

The twelth paper, written by W. T, Kyner of the University 
of Southern California, contains an exposition of the theory of 
non-linear resonance followed by application to the 
perturbations on the orbit of a 24-hour synchronous satellite. 

The expository portion is based on lectures delivered at the 
1968 Summer Institute of Dyn ami cal As t ronomy . In the application, 
it is shown that the longitude on a synchronous satellite 
satisfies a pendulum equation on the average. The validity of 
the pendulum mo del is restricted to time intervals of the 
order of 1 / y/ ^ 2 2* 
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INTRODUCTION 


Two internal publications authored or co-authored by 
members of the sponsoring laboratory and in the subject 
technical fields have appeared since the last compilation. 

These are listed below with their summaries. 

Miner, William E . : The Equations of Motion for 

Optimized Propelled Flight Expressed in Delaunay 

and Poincare Variables and Modifications of These 

Variables. NASA TN D-4478, May 1968. 

SUMMARY 

This document presents methods for developing the ordinary 
differential equations (o.d.e.) of motion in canonical form 
equivalent to the forms of Delaunay and Poincare. It also 
presents modifications to these forms so that three variables, 
which are constants of motion, result while the forms remain 
canonical . 

The equations of motion are for a vehicle propelled by 
constant thrust magnitude with a constant mass flow rate. 

The vehicle is moving in a central force field. The trajectories 
are optimum in the sense of classical calculus of variations in 
a neighborhood definable by the boundary conditions of the 
specific problem. Specific problems are not discussed in 
this do cumen t . 

The value of the document lies in two major areas: 

1. The possible economics in numerical calculations which 
may result from using these ordinary differential 
equations, and 

2. The application of the general perturbation theory of 
classical celestial mechanics to approximate solutions 
of these ordinary differential equations. 

This document has been written to record the results of the 
investigation and was not meant to be a tutorial treatment of 
the subject. For such treatment, the references listed below 
are recommended by the author: 

1. Bliss, G. A.: Lectures on the Calculus of Variations. 

University of Chicago Press, Chicago, 111., 1961. 

2. Goldstein, H.: Classical Mechanics . Addison-Wes ley 

Publishing Co., Inc., Cambridge, Mass., March 1956. 
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3. Ford, L. R.: Differential Equations . McGraw-Hill Book 

Co. , Inc. , N. Y., 1933. 

4. Smart, W. M.: Celestial Mechanics . Longmans, Green, 

and Co., Ltd., London, 1953. 

******* 

Hoelker, R. F., and Winston, B. P.: A Comparison of a 

Class of Earth-Moon Orbits with a Class of Rotating 

Kepler Orbits. NASA TN D-4903, November 1968. 

SUMMARY 

In two concurrent series of graphs, a class of orbits in 
the Earth-Moon (E-M) field and a class of Kepler orbits in 
rotating coordinates are depicted and compared. 

A general discussion of characteristics of rotating Kepler 
orbits is included. 

The model used for the E-M orbits is that of the restricted 
problem of threebodies. The orbits of the class depicted 
originate at the E-M line, half of the E-M distance beyond the 
moon with velocity orthogonal to the E-M line within the E-M 
plane. 
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A GENERALIZED MULTISTAGE PROBLEM OF 
BOLZA IN THE CALCULUS OF VARIATIONS, I 


By J. L. Linnstaedter 
Associate Professor of Mathematics 
Arkansas State University 
State University, Arkansas 
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A GENERALIZED MULTISTAGE PROBLEM OF 
BOLZA IN THE CALCULUS OF VARIATIONS, I * 

By J. L. Linns taedter 
Associate Professor of Mathematics 
Arkansas State University 
State University, Arkansas 

SUMMARY 

The problem is to find in a class of admissible arcs, satisfying 
certain multistage differential equations of constraint and end and 
intermediate point constraints, one which minimizes a Bolza type 
expression. The differential constraints may be different and different 
in number on the separate stages. Admissible arcs are continuous and 
piece-wise smooth in each stage but may be actually discontinuous at 
stage boundaries. The number of stages is bounded but otherwise not 
predetermined, since any stage will be allowed to degenerate to a null 
status. This is a generalization of the Denbow multistage extension of 
the Problem of Bolza. Appropriate imbedding theorems, a multiplier rule, 
and analogues of the tfeierstrass and Clebsch conditions are obtained. 

The theory of the second variation, the accessory minimum problem, 
and conjugate point conditions have been developed and will be presented 
in a subsequent paper. 

INTRODUCTION 

This study was motivated by the multistage character of many space- 
flight optimization problems. The problem treated is a generalization 
of the Denbow multistage extension of the Bolza problem [reference 3] • 

^This work was largely done at Vanderbilt University on NASA Research 
Grant NGR 43-002-015* The author wishes to thank Dr. M. G. Boyce for 
many helpful discussions during the performance of this research. 
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It is a generalization in the sense that the differential constraints 
may be different and different in number on the various stages, stages 
are allowed to degenerate, and discontinuities at staging points are 
permitted. The problem is approached directly as a multistage problem 
using extensions of the methods used on the Bolza problem [1 j . This 
approach avoids the transformation to a Bolza problem used by Oenbow [ 5 ] • 

Certain multistage control problems can be included in this problem 
by using techniques of Hestenese [4] and Valentine [6 j as has been shown 
for a simpler case by Boyce and Linnstaeater [2j. The applicability of 
multistage variational problems is best illustrated in a recent paper by 
Miner and Andrus [5 j • 

Three imbedding theorems, a multiplier rule, anu analogues of the 
w'eiers trass and Clebsch conditions are given. The first two imbedding 
theorems ignore the end and intermeuiate conditions and consider compar- 
ison arcs satisfying only the differential constraints. The necessary 
conditions given reduce to those for the Bolza problem whenever the 
problem degenerates to one stage. 

FORMULATION CF T AE PROBLEM 


The problem is to find in a class of admissible arcs 

y.(x); x < x, < ... < x ; x e [x , x j : i = 1, 2. ..., n: 

satisfying differential equations of constraint 

cp y» y* ) = 0| b = 1 » Pi •••1 ni < n$ a = 1, 2, ..., pj 

a 

x £ Cx a - 1- x a ]i 

and end and intermediate point conditions 

J U L V X l’ x p ’ y( x x + ) y(x p-l ) * y ^ x p ) * i = 0; 

p> = 1 , 2, ..., (j ^ ( 2n + 1 ) p +■ 1} 
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one which Minimizes a sum of the form 


p r a 

J = g[x ,y(x ),y(x"),y(x*),...y(x )j + JT I f a (x,y,y')dx. 

o poll p X J x 

a-1 

In the above statement and hereafter, y denotes the set (y^t***iy n )» 

and primes indicate differentiation with respect to x. Require y^(x) to 

be continuous for x e [x , x j - (x , x , 
op l d 


x^ and y^(x) to be 


finite non-decreasing set of points (x q , x^, • •«, x^) will be called a 
set of partition or staging points . The x q , x^, are not fixed 

but are to be determined by the minimization requirement. The left and 
right limits of y^^ and y!^ at points of discontinuity are assumed to be 
defined and finite. Variables occuring as subscripts denote partial 
derivatives and repeated indices in a product indicate summation. Let R 
be an open connected set of 2n + 1 dimensional (x, y, y T ) space with 
<p a , f a having continuous third order partial derivatives in 3. 
Furthermore, let the matrix jj have rank in R. Let S be an 

open set of 2np + p + 1 dimensional 

(x o , x lt ..., X p , y(x 0 ), y(x"), y(x+), .. ..yUVp ,y(x^_ 1 ) . y(* p )) 

space, with J^ f g having continuous third order partial derivatives in 
S. Moreover, require the matrix 

j|V Q V x •” V p J uy(x 0 ) J py(x-) J py(x*> — J vy(x p )|i 

to have rank q in S. 

A set (x, y, y f ) is an admissible set if it is contained in R. An 
admissible subarc C a is a set of functions (y^, y^, •••* y n )» 
x e [x^ x a J with (x, y, y f ) an admissible set and such that is 

Vr 


13 



MULTISTAGE PROBLEM OF BOLZA 


i, i = 1, 2,...,n. An admissible arc C is a partition set (x .x x ) 

o 1 p 

together with a set of admissible subarcs C a , a = 1, 2, ...» p, such 
that the set (x q , ..., x p , y(x Q ), y(x“), y(x*), . .., y(x^)) e s - 0n 
each admissible arc, (p a , f a , J, g, are assumed to be defined. 

ADMISSIBLE FAMILIES AND VARIATIONS 

Suppose there exists an admissible arc E satisfying tp a = J^ = 0. If 
there are no other arcs satisfying <p a = = 0 with which to compare E 

then the problem is trivial. In order to establish that the problem is 
not trivial, we will give conditions that an admissible arc £ can be 
imbedded in a family of comparison arcs. This will be the content of 
Theorems 1, 2, and 4. Theorem 4 gives conditions that guarantee other 
arcs in a neighborhood of E that satisfy <p a = 0 and = 0 while Theorems 
1 and 2 guarantee other arcs near E satisfying only tp a = 0. First, we 
need the following definitions, the first two being essentially the 
same as are given in Bliss and the third one is a multistage extension 
of the definition of admissible family given in Bliss [1, 194-195], 

A one-parameter family of arcs y^(x, b); x T < x < x”, | b | < e; is an 
elementary family if and only if y^(x, b) exist and y^x, b) have con- 
tinuous first derivatives with respect to b in a neighborhood of points 
(x, b) containing x T <x<x", |b | < e. Two elementary families are 
said to be adjacent if and only if they are defined on adjacent intervals 
and are continuous across the common end point. These definitions hold 
between partition points but not necessarily across partition points. A 
family of arcs will be called an admissible family if and only if y^(x,b) 

exist for x (b) < x < x (b), lb I < e; x (b), x_(b), ..., x (b) have 
o — — p 1 1 o 1 p 

continuous first derivatives with respect to b in the region | b | < e; 
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for each a there is a finite sequence of intervals [x\ xT], Cx^x T, 3* 
[x (k - 1) 1 x^] for k depending on a such that x 1 <x a _ 1 <b) < x" and 
x (k - 1) < x ^( b ) < x ^ k ). yi ( Xf b ) for x e Cx a-1 , x & ] is a part of a finite 
sequence of adjacent elementary families belonging to the sequence of 
intervals* 

The notation to be used for differentials of an admissible family 


is as follows: 

<*0 “ x ob dbt to l * X lb db ’ % = X pb db? 

dy ± = yldx + 6y ± where 6y i = y ib db and = y ix * 

The set of variations of the family along the arc E is the set 


•••» ^p’ defined by 


dx 


x ob (0) db 


io db ’ 


dx 


= x (°) db = j p dbj 


“p pb ~~ J P 

6y i * y ib^ X< * ^i^ x ^ db# 

The J , ..*, are constants and the t^Cx) are continuous and 

have piecewise continuous derivatives between partition points of E. 


Every set } q% } v *•*, j p » h^x) with these properties is called a 
set of admissible variations along E. 

If we require the arcs of an admissible family to satisfy * 0, 
then the variations t\^(x) along £ contained in the family for b = 0 
satisfy 


1“ (x, 1 , V) 

where the arguments of q>* and q>* . are (x, y(x f 0) , y (x, 0)) 

Py i Py i 

belonging to E* The equations = 0 are called the equations of 
variation along £• In these equations repeated subscripts indicate 
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summation. If E is specified then the coefficients of n. and n’ are 

i 'i 

fixed and independent of any family. 

,rhe equations of variation on S of the end and intermediate point 
conditions will be given by 

- v c fo + - + v p i P + Vx o > w * v. (x -) ♦ 


■ViK) V# + ••• + W) W 


i P 

where the arguments of the coefficients of the variations are the end 
and intermediate values of E. 


IMBEDDING THEOREMS 

.i/e can now state the first imbedding theorem. The proof of this 
theorem is a specialization of the proof of Theorem 2 and for this 
reason it is omitted. 

Theorem 1 . If an admissible arc E satisfies the equation <p* = 0, and 
J Q . -^l* •••» ^p» is a set of admissible variations 

satisfying the equations of variation ^ = 0 on £, then there is a 
one-parameter admissible family y^x, b) of arcs containing E, for the 
parameter value b = 0, satisfying the equations cp a = 0, and having the 

Set io* -fl* *••• j-p* V x) aS the variat i°ns of the family along E. 

The extension of this theorem to an s-parameter family is the 
content of the following theorem. 

Theorem 2 . If an admissible arc E satisfies the equations <p a = 0 and 

lf i oa » Jio* •••• i pa » n io ( x ) » (° = 1* 2, s) are s sets of 

admissible variations satisfying the equations of variation | a * o 
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along E, then there is an admissible s-parameter family y^Cx.b^bg* • • .b g ) 
of arcs containing S for the parameter values b^ * 0 (o = 1, 2, , ,♦, s), 
satisfying the equations tp a » 0, and having for each 0=1, .,,, s the 
set ^ oa » lo’ * * * ’ $ p 0 * T) i(J (x) aS the variations of the family 

along E with respect to the parameter b^. 

Proof, Let E be an admissible arc satisfying <p a = 0, and let 

p 

ioo* *lo' •••• ipo* \o M * (a = !• 2 * s > b « s sets of 

admissible variations satisfying the equations of variations ^ a = 0 

along E. Consider any arbitrary non-degenerate stage a with associated 

partition interval [x^^, x & ] . Extend the system of equations <p a (x,y,y f ) = 

0 by introducing new equations z =* tp a , (y = m +1, n), where the 

y y a 

functions (p a (x,y,y ! ) are chosen so as to have continuous partial 

derivatives of at least third order in a neighborhood of the values 

(x,y,y ? ) belonging to E* and such that q> a , / 0 along E*, 

I yy ± ’ 

Y * 1, 2, . ,,, o a , m a +1 * •••! n * 1116 Z Y are new variables, and E 8- is 

the subarc of E associated with the a stage. The equations = 0, 

P 

<p a = z^ determine functions z^(x) belonging to E 8 when y^(x) defining 
E 8 are substituted in these equations. The z^(x) are continuous except 

possibly at corners of E 8 . The equations of variations are 
_ a y a 

fp" 0 ' fr =J r 

where the functions of J are variations of z associated with the 
Y Y 

subarc E a and the variations The are dependent on i \ ^ and r^, 

so ^ corresponds to o = 1, 2, s. Furthermore, for each 

o, y icJ ( x ) is continuous except possibly at corners of E a or 
discontinuities of i^Cx). 
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The extended system of differential equations has solutions 
y i “ M i^ x * y ’ witb having continuous partial derivatives of at 
least third order in a neighborhood of the values (x, y, z) belonging 
to E 3 , since q> 3 , <p 3 have continuous third order partials. Let x 1 be 
the first value of x following x & ^ defining a corner of E a or a 
discontinuity of T]^(x), or let x* = x & if there are no corners on E a 
or discontinuities in ti . 1 ( x ). The functions z . f as defined on 

1 X0 y y o 

[x & x T ] can be extended so that they are continuous on a slightly 
larger interval. The right members of the equations 
= M i (x, y it z^Cx) + ^(x)) 

are continuous in x, y f . . . t y , b, , b and have continuous 

x n x s 

third partial derivatives with respect to the variables y^ t • • • »y n » b i* 

. ..» b g in a neighborhood of the values x, y^, y n , b^O,..., b g = 0 

belonging to E 3 where is the subarc of E a associated with 
[x a __, x T ]. Solutions 

y i = Y i^ x * x » •••» b s > 

exist for initial point (x f y^ f y^) with Y^ continuous and 

having continuous partial derivatives of at least third order with 
respect to the arguments y^, b Q in a neighborhood of the sets 
(x, x, y i$ b Q ) belonging to E a . 

The functions 

yi =T i [x - x a-r + 

define an elementary family satisfying the equations tp 3 = 0 on an 

P 

interval including [x^ x’]. 

The functions y^x, b Q ) have at x^^ the initial values 


18 


MULTISTAGE PROBLEM OF BOLZA 

y i ( Vl* V “ Y i [x a-1’ x a-i> + Vio (x I-l > * V 

= y^p + 

Furthermore, y^ b (x, 0) along have at x = x & ^ the initial values 
o 

^io^a-l) and y i (x * V Batisfies V* - 0» T* * * Y <*> + b a / (x). 

Thus along E* f (x, 0) = ij^Cx) because of the uniqueness of 

a 

solutions with initial values ti. (x . 

ho a-1 

This determines an s-parameter elementary family on the interval 

^ X a-1’ satisfying W* = 0 and having y ib (x, 0) = t^Cx) along 

Let x' T be the next value of x following x T on [x x^j defining a 

corner of n a or a discontinuity of (x), or x^ if Jt? has no other 

corners and. (x) have no other discontinuities. Repeating the 

preceding arguments produces an s-parameter elementary family on 

[x T , x T1 j which is adjacent to the elementary family on [x & x T ] and 

satisfying tp a = 0 with y. v (x, O) = t|. (x) on ^ (subarc for [x ! , x'']). 
p io^ m 

Continuing this process for a finite number of times gives a finite 

sequence of adjacent s-parameter elementary families which together 

give an s-parameter family of arcs in K satisfying the properties of 

the theorem for tne a stage with 

x _(b )=x.+b £ _,x(b)=x+b £ . 

a-1 o a-1 a ^a-la’ a o a o J ao 

By identifying the parameters of each stage with those of adjacent 
stages, an s-pararaeter family satisfying the requirements of the theorem 
is obtained. 


THE FIRST VARIATION 


Let an admissible arc E be imbedded in a one-parameter family 
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y i (x, b) with E determined by y^x, 0). Evaluate J along the family so 
that J is a function of the parameter b as follows: 

J(b) = g[x o (b) l x 1 (b), . .•,x^(b),y i (x o (b),b),y i (x 1 (b),b) 1 •,.,y i (x p (b) f b)J 

„ x ( b ) 


•1/ 


f*(x t y(x, b), y 1 (x t b))dx. 


a-1 x (b) 
a-1 


Taking the differential of J, we have 
P 


dj = dg + f a dx 
a=l 


a-1 


x 

P 

H J ^ - C 6 4 )dx - 

1=1 J x , J ± y i 

a-1 


The first variation of J along S is J where dj = Jdb with dJ evaluated 
along E. Explicitly, 


JCJ , n) = s + JZ f J 

a=l 


P r a 


a-1 


tf 

a=l J 


( f r\. + f rt , tj ! )dx 

y x 'a. 


a-1 


where the arguments of f a , f a , f a , are those determining S and 

y i y i 


S = 


g x / o + g v il + ••• + S x /p + E y.(x ) + \ (x o )] 

o J 1 P 1 o 

+ S y.(x-) [y i (x ! f 1 + ”i (x I )J + g y.(x+) il + \ (x l )] 


+ — + S y.(x p ) ip + r «i (x p )] * 


Define F a as follows: 


F a (x, yi y ! i A ) = ^ f a + 9 a 


with i a constant. 2 a function of x for each a, and u> are functions 
o u a 

cp a , <p a described earlier. Since 
P Y 


X \ + A ( $ - ^)=0, 

x ,- r • L r y 
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it can be added to the integrand of without changing the integrand’s 
value . Thus 

a x 

+ f£ 7 I + JjJ a C ^ 1 v^i-W dx * 


a-1 


a-1 


To prove the multiplier rule, we need the following lemma. 

Lemma 1. Let x e [x x ] and x , / x . If I . c. (i * 1. 2 n) 

a-i a a-l a o i 

are arbitrarily selected constants then there are multipliers ^ a (x), 
determined uniquely by 


V/ 


x _ "i 
a-1 


F* dx + c. , 

i 


which are continuous except possibly at corners of 

Proof . Following procedures of Bliss for the Bolza problem, define 


v. = F a t = ) f®, + X n (p a T . 

i yl o y ! « ^ay’ 


Now consider 


and 


dv./dx = F = ^ f + } tp 

1 y ± A o y ± A a a y ± 


v. (x _ ) = c. . 
1 a-1 1 


Further notice that the first system of equations can be solved for 
^ a (x) in terms of and v^(x). Substituting these in the system of 
differential equations gives 


dv./dx = A . v + 


B i 


The coefficients A. , B. are continuous functions of x between corners 
ia' 1 


of 


n a 


The existence of continuous (between corners) solutions v^(x) 


of this system with initial conditions v ^( x a is equivalent to 
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finding continuous functions X(x) (between corners) for the integral 

equations of the lemma. The system 

dv./dx = A. v + 1 B., v.(x )=c. 

1 ia a A o x x a-1 1 

is a linear first order system of differential equations. Let x T be the 

first corner of following x , . The initial condition then for tne 

a-1 

solution between corners defined by x’ ana x M is simply v.(x’). 
Continuing tnis, it is clear that the system has continuous solutions 
v^(x) (between corners) and hence there are continuous solutions ^ (x) 
(between corners) determined by the integral equations of tne lemma. 

The uniqueness of solutions to tne system of differential equations 
through a fixed point guarantees the uniqueness of the multipliers 
^(x). The set v^(x), and consequently the set ^(x), do 

not vanish simultaneously at a point unless they are all identically 
zero. 


He get a reduced form for the first variation by using integration 
by parts and the integral equations of this lemma* Hence, for all 
admissible variations J t r^(x) satisfying the 

equations <£ 3 =0, we have 


^ = * 0 1 , f3 / + * 0 s + X q 


a=l 


a-1 


a-1 


P r a 

- £ J K 

a=1 J x a-l 


dx. 


MULTIPLIER RULE 


We now proceed to state and prove a multiplier rule, or first 
necessary condition. 
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Theorem 3 » An admissible arc E, defined on an interval [x qI x^] 
said to satisfy the multiplier rule if there exist constants 
not all zero, and a set of functions 

Ax, y, y T , A ) = ^ Q f a + a p (x) a = 1, ...» p, 
with multipliers 3 p (x) continuous on [x q , x^J except possibly at 

partition points and corners of E» such that the equations 
x 



, a 
dx + c. . 
1 



x E [x a-l’ X a ] ’ 


is 

e 


U’ 


are satisfied on E and the equations 




- F 


'li 


dx 


a=l 


a-1 


P 

v 

hi 


A 


dy. 
y i 1 


a 

a-1 


+ ^ dg + e dJ = 0, J =0 

O ^ u V- u 

hold for end and intermediate points of E for every choice of differential 
dx , dx. f dx , dy (x ), dy (x7), dy (x|), .... dy (x ). For an arc 

E satisfying the multiplier rule the multipliers /J,_.(x) do not 

vanish simultaneously at any point of [x^, x^] , and right and let t limits 
are defined at partition points and corners of E. Every minimizing arc 
E must satisfy the multiplier rule. 

Proof . Let E be a minimizing arc for this problem. Let ^ , j£^ t » 

.... £ . ti . (x) be q + 1 sets Of admissible variations all of which 
* J px 'it 

satisfy the equations of variations = 0 along E f t = 0, 1, ...» q • 

By Theorem 2, there is a (q +1) - parameter admissible iamily of arcs 

(x - V b l V 

containing E for = 0, satisfying (p a = 0, and having / ot « •••» 
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Jpx* aS its variationc with respect to along £. The 

functions J, become functions of the parameters wnen the functions 
defining the (q + 1) - admissible family are substituted in them. The 
equations 


J(b c , • • • i b^) ~ J( 0f •••• 0) + u, 

W b l b q> - 0 

have the solution b Q = b^^ = . . . = b q = u = 0 corresponding to the 
minimizing arc d. We wish to show the determinant 


3 J .. . i J 

3 b Jb 

o q 




q 


is zero for b^ = b^ = ... = = u = 0. Suppose the contrary, then 

from existence theorems for implicit functions the above equations have 
unique solutions b (u) continuous near u = 0 and having initial values 
b^(0) = 0. But for negative values of u, the value of J on £ is larger 
than the value of J along some admissible arc corresponding to b (u) 
(for negative u). This contradicts the fact that J takes on its 
minimum value along 3. Hence the determinant is zero for all choices 
of the variations an. takes tue form 

3( ho* \o ) — * Jcq* V 

V /co’ , 'io ) •" V fcq* ’ 1 iq ) 
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where c = 0, 1, ...» p and i = 1, n and y * 1, q. 

'.Ve notice that this determinant is q + 1 by q + 1. Let t < q + 1 
be the .iiaximum rank attainable for this determinant for sets of 
admissible variations J QX * • ••* ^ t \ T (x) satisfying 

s 0 on 3. Furthermore, let this set of admissible variations be 
a set for which this rank is attained, 1'hus, there exist constants 
(not all zero) satisfying the following system of equations: 


A J( £ , Tj. ) + e J ( £ , tj. ) = 0, 

^o -'co 1 io y y Jco’ 10 

AJ(t‘ , q. ) + e J ( £ , q. ) = 0, 

*0 *cq’ iq y y J cq* 'iq 

Now, with these constants the equation 

^J ( / c , , t ) ♦ ^ Jj J c , t,.) = 0 

must be satisfied for every set of admissible variations j Q , ••• 

/p* satisfying the equations of variation = 0 along £• If 

this were not the case then there would exist a set of admissible 

variations £ 3| ' t q?(x) such that 

/ o j p 1 


v ( ic- 'V + % V /c* n i ) * °- 

,7e notice then that the q + 1 by q + 2 matrix 


3< 

Too’ 1 ’io ) ••• 

J( j 

f , TJ . ) J( 

cq* ‘iq 

Jc 

A , 



* 

V 

Jco’ n io ) •• 

• V 

icq* ^iq 

V 



"V 


has rank t + 1 since otherwise 
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But this would contradict t being the maximum rank. 

Substituting the simplified version given just before the 

statement of the multiplier rule in A J + e J = 0, we get 

o U 


a/ a / 


a=l 


V + 27 F y’ \ 


a-1 


a=l 


a-1 


x 

. p n a 

J - J" / t y dx = o. 

p p fzi J A y J r 


a-1 


The expressions not under the integrals are linear in j. 

• ^( x Q )t i ( x i ) » ^ i ( x pt •••i T l i (Xp). Consider the coefficients 


of t\ (x ), namely 

*o\tx Q ) - F J’ MX ° ))+ % J Mi (x o )* 

and recall Lemma 1. Since we can select c^ = It (x ,y(x ),y'(x ),/((x )) 

i y^ o o o o 

arbitrarily, we can make this coefficient vanish by simply setting 

Cd = - A g , \ + e J / -Similar remarks can be made concerning 

y i v o' u uy i^o' 

the vanishing of the coefficients of t;.(x*), t).(x+), t|.(x + ), ^.(x* _). 

i J- i 2 i i p— 1 

The remaining expressions must vanish for every arbitrarily selected set 

V A A- V*?’ V*? ’ 1 i ( Vi ) - W’ ^ y (x) - 

By choosing J q = ... = = t^(x^) = ... = r^Cx^) - 0 we have 

x 

P r * 

£/, v,*"' 


a-1 


Now the ^ f T * * a + 1* •••» n must vanish identically since the 
J can be arbitrarily selected. Similar choosing will show that 
the coefficients of ^ , "H i C x ” ) « • ••» ^(x^) 
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vanish. For ^ = 0, F 3 = f a and we can summarize by saying that 

the coefficients of j Q% ... f ^(x^, •••* 

T]^(Xp) vanish in the following equation: 


£ F > ' ^ * £ •** 
a-1 


+ e J = 0. 
P- U 


a-1 


Since dx Q = ^ Q db, •••» dx^ = db, dy^Cx^) * y^(x Q )dx o + Ti^(x o )db t 

dy i (x') = y^(x“)dx 1 + rj^x^db, dy^^Cx^) = y^(x^)dx 1 + q i (x^)db, ...» 

dy^(x^) * y^Cx^dx^ + this last equation can be transformed 

to the form given in the theorem. This together with Lemma 1 completes 
the proof of the multiplier rule. 


COROLLA RIFS TO MULTIPLIER RULE 


There are three important corollaries to this tneorem. 

Corollary 1 . At each point between partition points of an admissible 
arc E satisfying 


= 0, 


i*. 

y i 



dx 


X E Cx a-1* X a ]l 


the functions F a . have forward and backward derivatives, equal except 
y i 

at corner points and such that 

dF 3 . / dx = F a . 
y i y i 

Corollary 2 . At each corner between partition points of an admissible 

arc £ satisfying the equations in the hypothesis of Corollary 1, 

tne functions F a . have defined left and right limits which are equal. 
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Corollary 3 « On each sub-arc between partition points of an admissible 
arc 2 satisfying the equations in the hypothesis of Corollary 1, on 
which the functions y^x) defining 3 have continuous derivatives and 
the determinant 


a 


F® , , 

y I y k 



Vy! 


(i» k = 1, n) 


(Pi 


- 1, 


m ) 


is different from zero, the functions yl^(x), ^ belonging to 2 

nave continuous derivatives of at least the first order with respect to 

x. 

Corollaries 1 and 2 follow directly from the equations in the 
hypotheses , 

Proof of Corollary 3 « Let x be a value defining a point interior to 

some sub-arc cf the a stage with tne functions y^(x) defining 2 having 

continuous derivatives on this subarc and R* / 0 at x. The equations 

a 

/ x 

F* (x,y(x),y’(x) , ^(x)]dx 4 c®, 
y i 

x i 

a-1 

<Pp (x, y(x) , u ( x) ) =0, x e [x^, 

have tne solutions u. (x) = y!(x), p (x) = ^..(x) along 2. Notice 

1 1 K P 

the R* is the functional determinant of the left members of these 
a 

equations with respect to u. , u and R* / 0 at 

i P a 

(x, y ! (x), ^(x)) = (x, u(x), u(x)). Theorems on implicit functions 

say that solutions u. = y\ (x), \i = ;L(x) will have continuous 

i i p p 
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derivatives with respect to x near x of as many orders as the functions 

in the equations, in this case tp*, F a T , have with respect to x, u, p. 

P y i 

This guarantees then that we have continuous derivatives of y^Cx), A^( x ) 
with respect to x of at least first order. 


EXTREMALS, NORMAL AND ABNORMAL ARCS 


A stage extremal for the a stage is an admissible subarc y^x) 

without corners and with multipliers 

M 1. 2, m a , x e [x^, x ft ], 

for which y T . (x), q (x) have continuous derivatives on the interval 
1 P 

[x , x ] and satisfy equations (j> a = 0 and dF*» A - . An extremal 

a-1 a P y^/^ 


is an admissible arc which on each stage is a stage extremal. An a 
stage extremal is called non-singular provided R* / 0 along it. An 
extremal is called non-singular if each of its stage extremals is 
non-singular. 

Let M be the class of admissible arcs satisfying <p a = 0, J = 0. 

P M- 

An arc E e M is said to have abnormality of order r if it satisfies 

Theorem 3 (the multiplier rule) with r and only r linearly independent 

sets of multipliers of the form (x), 1, 2, ..., r. 

If r = 0, the arc £ is said to be normal . A set of multipliers q Ql 

q (x) with A =0 will be an abnormal set of multipliers. If a 
P ° 

normal arc E has a set of multipliers, then by dividing by a suitable 


constant these will have the form « 1, R^(x). The set of 

multipliers with q^ = 1 for a normal arc is unique, since if it had 
more than one they could be put in the form having ^ = 1, and the 
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difference of two such sets of multipliers would be a set of multipliers 
with = 0 and hence abnormal. 

A THIRD IMBEDDING THEOREM 

We can now prove for multistage problems the theorem given by 
Bliss for one-stage problems Cl, 214], The proof again follows the 
pattern of Bliss’ proof. 

Theorem 4 . If an arc E e M is normal, then there exists an admissible 
one-parameter family of arcs in M which includes £ for parameter value 
b =* 0 and which has in every neighborhood of E arcs of M not identical 
with E. 

Proof. By Theorem 2, the normal arc E may be imbedded in an admissible 

(q + 1) - parameter family of arcs x (b , b. , ..., b ), y.(x, b ,b., 

cox q i o 1 

• ••♦ b q ) satisfying only the differential equations <p® = 0. 

Consider the matrix 


3 < /co’ V 

... 

J ( 

/cq* 

, ’iq ) 

J ( £ , 11. ) 

U /co 'io 

... 

A 

V 

} cq* 

V 


and note that the maximum rank attainable for the last q rows must be 
q. For, if it were less than q, then there would be a set of constants 
^ = 0, e^ (not all zero) satisfying the equation 

^o S( jc’ V + % V V - 0 

for every set of admissible variations t , t|.(x) and determining a 

T c i 

set of multipliers - 0, /^(x) (not vanishing simultaneously) for 

E. This contradicts E being normal. Now suppose that the variations 
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, n have been chosen so that the determinant of tne first q 
Jct’ 'it 

columns of the last q rows of the above matrix is different from zero, 
and let these be the variations of tne family x c (b Q , •••* b q )» 
y.(x, b Q , b q ). Substitute into the functions replace b q by b, 

and consider the equations 

(b o’ b l b q-l’ b) = °* 

These equations have the solution b Q = b^ = = ^q-l = ^ = 0 at 

which the determinant 

dJ u ^ J U ... 

” ib q-l 

is different from zero. From implicit function theorems they have 
solutions b f = B f (b), f = 0, 1, ...» q-1. with continuous derivatives 
near b = 0 and with initial values B^(O) = 0. -^he admissible one- 
parameter family of arcs is obtained from y^x, b Q , b l( b q ) by 

replacing b q by b and b f by B f (b). This family contains E for b = 0, 
and when b is sufficiently small the arcs of this family belong to M. 
Replace the set of variations ^ cq , n iq U) by the set r\ ± (x) , 

then the variations along E of the one-parameter family are given by 

/cf B f (0) + ->c’ , 'if (x) B f (0) + V x) ’ 

where the primes indicate differentiation with respect to b. If the 

n variations 

H if (x) B T f (0) + T| i (x) 

are not all identically zero, then the family will contain arcs not 
identical with E. Now when the functions T] if have been chosen to 
secure rank q for the matrix 
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IK ( ho' ’ 1 io ) ••• ( ic q-l> \ q_x> || 

the variations can always be selected linearly independent of them, 
thereby insuring that the variations 
ri if (x) B^(0) + n i (x) 

are not identically zero. This selection can be made by determining 
the functions / if (x) corresponding to the variations T}^(x) by 
means of the equations 

= °. ■ -^ Y (x) > Y = « a + 1. .... n, 

and then selecting /^(x) linearly independent of J (f = 0,1,..., 

q-1) and finally choosing for the variations t^Cx) solutions of the 
equations 

$ p ( X * ^ V ) = 0* <£ y V ) S 

with the functions _A(x) substituted in these equations. 

Corollary 4 . If ti^(x) is a set of admissible variations 

satisfying the equations of variation <|) ^ = 0, = 0 along a normal 

arc E e M, then the one-parameter family of arcs in M imbedding E of 
theorem 4 can be so chosen that it has the set j , t^Cx) as its 
variations along E. 

Proof. The one-parameter family constructed in Theorem 4 will suffice 
for this corollary provided B^(0) appearing in the variations, all 
vanish. Consider the equations 

<yB f (b), b) a 0, )i a 1 

If we differentiate these equations, we have at b = 0 the equations 
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( h V B f (0) + K < Jo* V * O- 

Since t^(x) satisfy the equations = 0, the above equation 

reduces to 

| K ( /cf "if 5 B ' f (o) - °- 

[ Now the determinant 

j | "V ( Icf' ’'if ^ I 

1 has rank q and hence is different from zero. Hence the B^(Q) are zero. 

tie can now state another corollary to Theorem 4, and, because it 
is concerned with what happens on sub-arcs between corners of E, it is 
precisely the same result that Bliss obtained for the Bolza problem 
Cl, 21^]* We state it here without proof. 

Corollary $ . Each of the sequence of elementary families which together 
form the one-parameter family of arcs in M described in Theorem 4 and 
Corollary 4, is defined by functions 

y^ (x, b), x T < x < x tT , | b |< e, 

for which the derivatives y^t y!^ exist and are continuous in a 
neighborhood of values (x, b) satisfying the conditions x f < x < x rt , 
b = 0. If the imbedded arc E is an extremal, so that the functions 
y^(x) defining it have continuous second derivatives, then the 
derivatives y ifc)b » ^ibb* ^ibb^ 31150 exist at the values (x, b) 
satisfying x T < x < x n , b = 0, and yl^ * (y^bb^ • On each elementary 
family the following differentials exist and satisfy the equations 

^o = X ob db ’ *••• dx p = x p b db ’ dy i = y i db + 6y i’ 
d 2 y i = y!^ d 2 x + y| dx 2 + 26y!^ dx + b 2 y i * 
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WE IERSTRASS CONDITION 

We are now able to state and prove an analogue of the necessary 
condition of ’Weierstrass. 

Theorem 5 . An admissible arc E satisfying the equations ip * * 0 and the 
multiplier rule, with multipliers = 1, Ap(x), is said to satisfy 
this analogue of the necessary condition of Weierstrass with these 
multipliers if the condition 

W®<x, y, y , ^ , Y 1 ) = F®(x, y, Y' , ^ - F®(x, y, y’ , A ) 

- (Y’ - y!) (x, y, y' , ^ ) > 0 

is valid at every element (x f y, y T , A ) of E, except possibly at 

partition points of E, for all admissible sets (x, y, Y ! ) / (x, y, y T ) 
satisfying the equations tp* = 0. Every normal minimizing arc E for this 
problem must satisfy this condition. 

7/e need the following lemma in order to prove Theorem 3* 

Lemma 2 . Let E be a normal minimizing arc. Then there is a set of 
admissible variations f = 0,1,..., q-1, 

satisfying the equations of variation ^ * 0 along E such that 

| \ ( h V ’’if 5 I * °* 

Proof of Lemma 2 . Suppose that for every set of q admissible variations 

satisfying * 0 the determinant | / * O | * 0. We consider the 

eauation 1 J + e J =0 which must be satisfied by every set of 
H 'o p p 

admissible variations. The condition that j | = 0 for a set of q ad- 
missible variations implies that - 0, contradicting the normality 
of E. 
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Proof of Theorem 5 . Let a be arbitrary and consider the stage 

associated with the interval x^. As in the Bolza problem 

Cl* 220], let t be an arbitrary point between corners of E 3 . Let Y* . 

i 

i - 1» 2, . n, be a set of values such that the element 

[ v Y ’ J is admissible and satisfies the equations <p* = 0. 

This system of differential equations can be enlarged as in the proof 

of Theorem 2 so that the continuity properties described there hold 

near the element [x. , y(x ), Y T ] as well as near E* with L a .I/O 
z t | ay!^ | r 

a t Cx^_, y(x t ), Y ] as well as on E 3 • The enlarged system defines a 

set of functions z^(x) corresponding to the functions y^Cx) defining 

E a , and a set of constants associated with the set [x^, y(x t >, Y T ] . 

The equations of variation define functions j^(x), o = 1, s, 

corresponding to each of the sets of admissible variations Z . n (x) 

jqo* l i a 

of Lemma 2* As in the Bolza problem, we can infer the existence of 
three families of admissible arcs 

y ± (x, b), x^ 1 - 6 < x < x t , | b |< e, 

Y ± (x, b), x t < x < x t + e, | b |< e, | e | < e, 

y t (x, b, e) x t + e < x < x & + 6, | b |< e, | e | < e, 

satisfying differential equations 

y^ = M i (x, y, z(x) + b o J a ) f X a _ x - 6 < x < x t , 

y i s z )» x t < x < x t + e, 

y\ - M. (x, y, z(x) + b y ), x. e < x < x + 6 

i i oo T t — — a 

and initial conditions 
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y± (x a _ lt b) = y. (x a _ 1 ) + b o r, i0 (Vl>- 
Y ± (x t , b) = y i (x t , b), 

y i ^ X t + e ’ b * = Y i ^ X t + e ’ b ^‘ 

The system of differential equations is equivalent to 

<p* = 0, <P* = Z T (x) + b o n ia (x), x a _ 1 - 6 < x < x t , 

<Pp = 0, = Z , x t < X < x t + e, 

<Pp * 0, = Z Y (x) + b o ti ia (x), x t + e<x<X a +6 

with y\ ~ y(x jL ), Z). For values e > 0 the three families form 

i x t 1 ° t 

a single admissible s-parameter family of arcs consisting of a finite 
sequence of adjacent elementary families. The functions defining 
these elementary families and their derivatives with respect to x have 
continuous partial derivatives with respect to the parameters b and e. 
Continuity with respect to b follows from the arguments of Theorem 2 
and for e from well-known existence theorems in differential equations 
[ 1 , 2 ? 8 ]. 

If b = e = 0 then the first and third families reduce to the 
functions y^(x) defining the arc E*. The variations along E* with 
respect to b Q of the first and third families satisfy the differential 
equations of variation (for the enlarged system) with the functions 
y^(x) corresponding to the variations q^(x). If r ( ^(x) denotes the 
variations along the arc E 3 of the first and third families with re- 
spect to e, then they satisfy the equations 



and the relations 
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Tl i (x) = 0, x a -l " & < x - x t* 

y'(x t ) + T, 1 (x t ) = Y'. 

Cn each of the other stages one gets an s-parameter family y(x , b) 

of comparison arcs and by matching up parameters these stage-wise 

comparison arcs piece together a family of comparison arcs for the 

problem under consideration. Furthermore, when 

x (b) = x + b £ 
c ca yea 

are used to define staging points and substituted in the end and inter- 
mediate ooint constraints J , then the functions J become functions 

p ]L 

J^(b, e) of the parameters b, e. At the values (b, e) = (0, 0) these 
functions have, as in the case of the Bolza problem, derivatives 
J 3J 

Tt^ = \ ( K' V’ \ I** 

The equations J^(b, e) = 0 have initial solutions ( b , e) = (0, 0) at 
which functional determinant 

■V ( V 

is different from zero. Only the second subscript on j. , n is being 

used, actually these should read j- c0 * ^ ow the ec l ua tions 

J (b, e) =0 have solutions b = B (e) which vanish at e = 0 and have 
]i o o 

continuous derivatives near e = 0. These derivatives satisfy the 
equations 



at e s 0 , By replacing the parameters b Q by b Q = B Q (e) in the comparison 
arcs and end and intermediate conditions, we get a one— parameter family 
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of comparison arcs which contain the minimizing arc E for e = 0 and 
which are admissible for sufficiently small positive values of e. 

Now the function J can be written as a function of the parameters 
b, e as follows, 

J(b t e) * g[x (b) f .. .,x (b),y(x (b),b) f . • • ,y(x~(b) ,b f e) , . . . ,y(x (b) ,b)] 

o p o & p 

A (b) , r Vi (b) 

(xjyCx^b),^ (x,b))dx 


/ x , ^a-i 

f (x,y(x,b) (x,b))dx+. • .+ J f a “ 

x (b) J x ? (b) 

o a-c 

X t X t 

+ ^ f a (x,y(x t b),y T (x,b))dx + j 

J v j v 


X a-l (b) 


x (b) 

r a 


f (x,y(x t b) ,y T (x,b) )dx + / f^x.Y.Y* )dx 

x. 


*a + l (b) 


+ / f (x,y(x,b,e),y T (x f b,e) )dx + 


x (b) 

”P 


f S+1 (x t y(x f b ) f f (x,b) )dx 


x (b) 


/ p 

f P (x,y(x t b),y T (x,b) )dx. 


x p-l (b) 

using precisely the same techniques as are used in the Bolza problem 
we find that the derivatives of J at (b t e) = (0, 0) are defined by 
<)J 


— yr + e u 

db a U \L 


h ( he' V m °* 


a J 
a e 


+ Vu ( ic* V = y ' y ’ 


A 


Y T ) 


where 

^(x, y, y T , A , Y t ) = F*(x, y, Y 1 , A ) - f^Cx, y, y* , A ) 

- (Y! - yl) F a (x f y, y\ A ). 

1 1 y i 

The arcs defined by functions y(x, b f e) for x e [x^ x^] are 

not admissible for small negative values of e but are admissible for 
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small positive values of e. Since J(E) is to be a minimum, as e 

increases from zero, the sum J(B o (e), e) must be non-decreasing. Thus 

the derivative at e = 0 must be non-negative. The derivative of this 

sum at e = 0 is given by 

J b (0, 0) B^(0) + J e (0, 0). 
a 

It follows easily then that W 3 (x, y, y r , A , Y T ) > 0 between corners 
of E 3 . One can also see from simple continuity arguments that 
W* (x, y, y\ A t Y t ) > 0 
at corners of £?'. 

CLEBSCH CONDITION 

We follow the analogue of the //eierstrass condition by an analogue 
of the Clebsch condition* 

Theorem 6* An admissible arc E satisfying the equations <pf = 0 and the 

" P 

multiplier rule with multipliers ^ = 1, ^,,(x) is said to satisfy 

o p 

this analogue of the necessary condition of Clebsch with these multi- 
pliers if the condition 

(*. y. y' . * ) vt k - 0 

y i y k 1 K 

holds at every element (x, y, y’, A ) of E, except possibly at staging 

point, for all sets (it^, • ••? 7l n ) / (Of 0, 0) satisfying the 

equations t (x, y, y f , ^ ) u = 0. Every normal minimizing arc for 
Py i 

this problem must satisfy this condition. 

Proof . Let E be a normal minimizing arc for this problem. Let a be 
arbitrary and tu, i = 1, 2, ..., n, be a set of values satisfying the 
equations 
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<pp » (x, y, y' . •* ) n ± = o 


where the element (x, y, y\/t ) belongs to E. 


quantities n are defined by the equations 


ry. 


U, y» r * A ) ^ 


Now n - m further 
a 


The equations 

<Pp (x, y* p) = 0, <P* (x, y, p) = z y + e ^ 

have the initial solution (e, p) and determine a set of solutions 
p^(e) with initial values p^(0) = y|. Now the above equations become 
tPp (x, y, p( e) ) = 0, U, y, p(e)) = z^ + e ^ 

and differentiating with respect to e and replacing p i (0) by y^ in 
notation produces 

PiCO) = 0, <P^ P|(0) = 1 <Y , 

so p^(0) - • Now sets (x, y, p(e))are interior to R for sufficiently 

small e, hence from Theorem 5, we have 
Ax, y, y* , A , p(e) ) >0. 

Recall /(x, y, y’ t A , p(e)) = F a (x, y, p(e), A ) - F^x, y, y», A ) 

- (PjU) - y^) F a T (x, y, y», A ), 

and note Uf 8 = 0 at e = 0 giving a minimum value to .V 8 . Differentiating 

with respect to e and evaluating at e = 0 produces 

(E a (0)) ? = n.F a t - n. F a . = C 

i y ± i y ± 

and (E a (0) )” = F a . t Tt.rc, . 

y i y k 1 k 

Clearly, F a ? T be non-negative and this completes the proof. 
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by 

J. Warga 

1. Introduction . We wish to study a class of variational prob- 
lems defined by functional equations and, in particular, by non- 
linear integral equations. Special problems of this kind, in- 
volving one-dimensional "hereditary" and delay-differential 
equations were investigated, among others, by A. Friedman [1] , 

M. N. O^uztoreli [2 ], and A. Halanay [3 ] (see also [2 ] and 
[ 3 ] for other references to work on such one-dimensional prob- 
lems) . Control problems defined by multi-dimensional integral 
equations were discussed in a heuristic manner by A. G. Butkovskii 
[4 ]. The "usual" control problems, defined by ordinary differ- 
ential equations, also represent a special case. 

Among possible applications of our results, as specialized 
to integral equations, we may mention, in particular, nonlinear 
control problems defined by partial differential equations that 
are equivalent to Uryson integral equations. The methods that we 
employ are closely related to those previously developed in [5] 
and [ 6 ] . 


This research was supported by N.A.S.A. Grant NGR 22-011-020, 
Supplement 1. 
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As a convenient framework for our study we consider the 
following problem: let XJ and Q be given spaces, Xj Hausdorff, 

XX a subset of Q , the euclidean m-space, a closed 

subset of , and F : Xj x Q + ^ and 

c = (c\.. ,c m ) : X^ x q + E m given functions. The "original 
problem" consists in determining an "original minimizing 
point", that is, a point (y,u) e Xj x ]X that minimizes 
c 1 (y ,u) on the set {(y,u) e Xj x It | y = F (y ,u) , c (y ,u) eB 1 >; 
the "relaxed problem" consists in determining a "relaxed 
minimizing point" (y,q) and an "approximate minimizing solution" 

{ (Yi ,u^) >i_i / that is, a point (y,q) £ x q that 

minimizes c^(y,q) on the set { (y,q) e x q | y = 

F(y,q) , c(y,q) e B^}, and a sequence ^yi ,u i^i = i 

Xl x It such that y. = F(y., u. ) and lim c(y.,u.) = c(y,q). 

® ^ i->oo ^ ^ 

This formulation is motivated by a typical model of a 
control problem: the parameter u describes the control functions 
and parameters (that can be chosen from some "admissible" 
set \\y ) , the point y describes a motion of the system 
consistent with the chosen controls and subject to the "equation 
of motion" 

(1.1) y « F (y , u) , 
the relation 


(1.2) c (y , u) e B 1 

describes the restrictions imposed on the system, and c ^ 
is the cost functional. 
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In general , as in the special case of variational 
problems defined by ordinary differential equations , the original 
problem , with controls in It , does not admit a minimizing 
solution even if the functions F and c are "nice". We 
therefore embed It in a set Q of "relaxed controls and 
define an appropriate topology on Q in which It is a 
dense subset of sequentially compact Q and F and c are 
continuous when restricted to the set 

{ (y,q) e xQ | y = F(y,q)} . This insures, subject 

to certain mild assumptions about F and c , the 
existence of a relaxed minimizing point (y , q) and of 
an approximate minimizing solution. The desired "relaxed 
behavior of the system can be simulated by using an element 
of an approximate minimizing solution. 

In studying necessary conditions for minimum we 
require somewhat different assumptions related to the nature 
of as a Banach space, the convexity of Q , the existence 

of (Frechet) derivatives F y and c y , and the 
invertibility of I - F y (y,q) at the relaxed minimizing 

point. 

We observe, in §53 and 4, that the usual optimal 
control problems defined by ordinary differential equations 
belong to the class of problems that we have described; so do the 
more general control problems defined by Uryson-type integral 
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equations that we discuss in some detail in §§3,4,6, and 7. 

We discuss, in §2 , the following aspect of the 
general problem: (1) the existence of a relaxed minimizing 

point (y,q) e x Q? (2) the existence of an approximate 

minimizing solution; and (3) necessary conditions for a relaxed 
minimum. The corresponding proof is presented in §5 . We 
then apply these results in §§3 and 4 (with the proofs in 
§§6 and 7) to a control problem defined by a Uryson— type 
integral equation. 

The general results for the Uryson-type relaxed control 
problem that we present in §§3 and 4 require rather complicated 
assumptions and setting that are introduced with the view 
toward generality and possible applications. As a consequence, 
the theorems are rather involved and the assumptions complicated. 
We therefore present, at first, less general results that have 
the advantage of greater simplicity. 


1*3. The simplified Uryson-type control problem . Let T and 
R be compact subsets of some finite-dimensional euclidean 
spaces, dx the Lebesgue measure on T, and (t,T,v,r) -+ g(t,T,v,r): 
TxT x E n xR-*E n continuous and such that g^ are inde- 
pendent of t for i = l,2,..,m <. n. We represent by S the class 
of regular Borel probability measures on R . The "original problem" 
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consists in determining functions y: T + and p: T -*• R 

that yield the minimum of 

c 1 (y,p) = y 1 = f g 1 (x,y(T) ,p(x) )dx 
T 

among all couples (y f p) for which y is continuous, p 
measurable , 


(1.3.1) y (t) = / g(t,x,y(T) ,p(t) )dx 
T 


and 


(t e T) , 


(1.3.2) y^ = 0 (i = 2,...,m). 

(Note that y 1 are constant for i < m since the corresponding 
g 1 are independent of t) . The "relaxed problem" consists in 
determining a relaxed minimizing solution (y,a) , that is, func- 
tions y; T E and a: T + S that yield the minimum of 

c 1 (y,a) = y 1 = / ix f g i (t ,y (t) , r) a (dr ;t ) 

T R 

in the class tffc of all (y,c) for which y is continuous, 

the function x f <J>(r) <?(dr ; t) measurable for all continuous 

R 

scalar <J>, 


(1.3.3) y(t) 


f dT / g(t,x,y(T) ,r) o(dr?x) 
T R 


and 


(t e T) , 


(1.3.4) y 1 = 0 


(i = 2 , . . ,m) . 
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We can state, as a consequence of the results presented in §§3 
and 4, the following theorem: 


Theorem 1.3.5. Assume that g and g Ty = ( Bg ) (i,1 =1 , . . ,n) 

exist and are uniformly continuous and bounded on T * T * * R, 

and that the class A is nonempty. Then there exists a relaxed 
minimizing solution (y,a). 

If y is the unique continuous solution of the integral 
equation (1.3.3) for a = a then there exists a sequence 
{ p_. ) j _ ^ of measurable functions and a sequence «-Yj of 

continuous functions such that the (y^,Pj) satisfy equation 


(1.3.1) for j = 1,2,... and lim y. 

j ‘ +0 ° * 

If the linear integral equation 


for i = 1,2,.. ,m. 


w (t ) = f k(t,T)w(x)dT (t e T) 

T 

has only the trivial solution w(-) = 0 for 

k (t , x) = / g (t,x,y(x) ,r)a(dr;i) (t,xeT) then the relaxed 
R 


minimizing solution (y,a) satisfies the following necessary 
condition for minimum: 


there exist a nonvanishing X = (X A , . . , X m , 0 , . . 

. ,0) 

£ 

E 

n 

*i 

and a resolvent kernel k* = (k ^ ) ( i , j = l,..,n) 

of 

k 

such 

that ( t , t ) + k* i j(t,x) = k* i ^(x) are independent 

of 

t 

for 

i = 1 , . . , m and 
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(Weierstrass E-condition or maximum principle) 

/ dx / c(t) • g(x,0,y (0) ,r)a(dr;Gf) 

T R 


where 

and 


- Min / 5 (t) • g (t ,y,Y (9) / r' ) dx for almost all 

reR T 

C(T) = (C 1 {T ) ,t n (T)) 

C D (x) = Z X 1 k 1 .(t) + X D / [ T | (j = 1,.. ,n ; x e T) 
i = l 3 


b e T, 


(We say, in the present context, that k is a resolvent kernel 
of k if the equations 


w ( t ) = / k(t,x)w(x)dx + h(t) and w(t) 
T 


are equivalent for continuous w and h) . 


/ k (t,i)h(x)dx 
T 

+ h (t ) (t e T) 


The above theorem, which we prove in §8, is much too weak 
for our purposes: it does not even apply to control problems 

defined by ordinary differential equations. We consider, there- 
fore, in § § 3 and 4, a Uryson-type control problem in a more 
general setting: the sets T and R are assumed metric and 

compact, the "original controls" p are restricted by the con- 
dition 

p(t) e R # (t) (t e T) (where t + R # (t) C R is given) 
and the "relaxed controls" a satisfy analogous restrictions, the 
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function g has, as an additional argument, a "control parameter" 
b in a metric and compact space B, the condition (y 2 ,..>y n ) = 0 
is replaced by (y\..,y n ) e for a given B^ , the uniform 

continuity and boundedness of g and g^ are replaced by weaker 
assumptions, and the class ^ of solutions y of the integral 
equation is extended beyond the class of continuous functions . 

We then study the existence aspects of the control problem 
for integral equations assuming Xj to be h 1 (T f E^) ; and 
we examine necessary conditions for a relaxed minimum assuming 
that is either L p (T,E n ) for 1 < p < « or C(T,E n ). 

Necessary conditions for an original minimum will be discussed 
separately along the lines of [6]. We might mention, finally, 
that certain more general unilateral and minimax control 
problems that have been investigated for ordinary differen- 
tial equations [7], [8], [9] extend quite naturally to integral 
equations; but we have only partial results so far. 

I wish to acknowledge with thanks several stimulating 
conversations with J. Frampton. 

2. The General Control Problem . Lemmas 2.1 and 2.2 below are 
obvious and are stated only to motivate the corresponding state- 
ments concerning the Uryson-type control problems and their 
proofs (Theorems 3.2 and 3.3). Theorem 2.3, on necessary condi- 
tions for minimum, is patterned after [6, Theorem 2.2, p. 644] 
and relies ultimately on a construction of McShane [10, pp. 17-18]. 
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We use the term "derivative" to mean "Frechet derivative" 
and the notation h x (x 1 ,y 1 ) , (x 1 , 7 ^) to represent partial 

derivatives. If h is defined for x in a subset F of 
a Banach space X and y in a Banach space ij , 
with values in a Banach space ^ , we say that h has a 
derivative h ^ ^ ^ x l' y l^ at (x^,y^) relative to 
T x if h ^ ^ (x^y^) is a linear operator from 

X x lj to ^ such that 

|h(x 2 ,y 2 ) - Mx^yp - h ( Xf y ) ( (* 2 ,y 2 ) ~ < x i'YiU| = 

o(|x 2 - x 1 |^ +|y 2 - yj ) for all x 2 e V and 

vr 

for all Y 2 £ Xj • The symbol I represents the identity opera- 
tor on 'lj . If Q is a convex subset of a linear space, 

X is any set and h is a function from X x Q to some Banach 

space, we write Dh(x,q;q - q) for lim i(h(x,q + a (q -q) ) - h(x,q)). 

cc++O a 

We denote by A the closure of A. 

Lemma 2 . 1 Let and Q be Hausdorff spaces, Q and 

y 1 = {y e lj |y = F (y,q) , c(y,q) e B 1 , q e Q} sequentially 
compact and F and c continuous when restricted to x Q . 

Then either there exists a point (y,q) that minimizes 
c 1 (y,q) on { (y,q) e * Q|y = F(y,q), c(y,q) e } , or 

that set is empty. 

Lemma 2 . 2 Let 'll and Q be Hausdorff spaces , 2l a 
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dense subset of Q , and (y, q) a relaxed minimizing 
solution. Assume, furthermore, that Q satisfies the 
first axiom of countability and that 

(2.2.1) y is the unique solution of the equation 
y — F (y , q) ? 

(2.2.2) there exists a neighborhood Q of q such 
that the equation y = F(y,q) has at least one 
solution y for each q e U H Q ; and 

(2.2.3) the set Y 2 = (y e |y * F(y,q),q e Q} 

is sequentially compact and F and c are continuous 
when restriced to Y 2 x Q • 

Then there exists an approximate minimizing solution. 

Theorem 2.3. Let 'll be a Banach space, Q a convex 
subset of a linear space, u> D an array with real elements 

(jj 1 ^(i,j * l,...,m) considered as an element of E 2 with 

m 

origin , U = {w 1- [ 0 , I w ^ 1} , and (y*q) 

i,j = 1 

a relaxed minimizing point. Assume, furthermore, that for each fixed 
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subset {q^j | i , j = 1,. . . ,m} of Q there exists a neighbor- 
hood \J x n of (y, 0 P ) in ^ such that the functions 


'i 


( y, aT) ■> F(y,q + E ^ w 13 (q i j - q) ) : ^ ^ 


and 


x ft 


E are con- 
m 


(y, w' 7 ) c(y,q + l lj 3 (q. . “ q) ) : (^7 

i,j = 1 3 _ h 

tinuous , have derivatives at (y, 0®) (relative toX^ x fl) 

and continuous partial derivatives with respect to y on 
that 

Lj^x ft, and^the operator I - F (y q) is a linear homeomorphism 


of l 


^ onto Ljf. 


Let be a convex set in some E^, 


£ c K^, and 


(<(> 1 ,...,<j) ,n ) : K 


a continuous mapping 


with a derivative at £ and such that cf>(£) = c (y ' q) ■ Then 
either 


(2.3.1) <t>Lc)S - Min 4>i (S) C » 

or there exist Y >_ 0 and X e E m such that |x| ? 0 , 

(2.3.2) X • (c (y,q) (I - F y (y,q) ) ~ 1 DF (y ,q; q - q) + 
Dc(y,q"; q - q) } >^ 0 for all q £ Q, 

and 


(2.3.3) <y 6, - X) $-(£)£ - Min ( Y <5 - X) 4> r <£)£ / 

1 * $£K 1 1 * 

where 6^ = (1,0,. ..,0) e E^. 

3 . Control Problems Defined by Uryson-type Integral Equations . 
Existence of Relaxed and Approximate Minimizing Solutions . 

Let T, R and B be compact metric spaces. We assume that 
a nonnegative, finite, regular, complete, and nunatomic 
measure dt is defined on the Lebesgue extension of the 
Borel field of sets in T and we consider the corresponding 
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product measure dtdx on T tf T. The symbol |m| represents 

the measure of M cT, /h(t)dt the integral over T, |a,b| 

the distance in a metric space, and |a| (or |a| E ) the norm 

in a normed linear space E. We represent by L p (T,J0 

(1 _< p < 00 ) the Banach space of measurable functions h from 

T to a Banach space with the norm | h ( * ) | = { / | h C t ) | P dt}^ P 

cy/ ^' space P 

and by the Banach^of continuous h from T to 

with the norm | h ( - ) | = sup |h(t) | . We also set 

°° teT ^ 

L P (T) = L P (T, E ) and C(T) = C(T,E 1 ). 

Original and relaxed controls . 

Let be the class of measurable mappings from T 

to R. As in [5], we refer to functions from T to R as 

"original controls". Let S be the class of regular Borel 

probability measures on R, and J the class of "measurable 

relaxed controls", that is, mappings a from T to S 

that are measurable in the sense that t -► / 4>(r) a (dr; t) 

K 


is measurable on T for every continuous <j> ; R -* E^. We 
define as a subset of \J by identifying the function 

t -► p(t) with the function t + a^(t), where a^ft) is a 
measure concentrated at p(t) with mass 1. We also identify 
all controls, original or relaxed, that differ only on sets 
of measure 0. 

Topology in the space of measurable relaxed controls . 

We define a topology in ^ as in [5, p.631]; we repre- 
sent by ^ the Banach space (which is actually the space 

L^(T,C(R))) of real-valued functions 4> on T x R, continuous 
on R for every t, measurable on T for every r, with 

t -* sup | 4>(t,r) I integrable, and with 1 <f> | = / sup | 4>(t,r) | 

reR g reR 

We then define every c e as an element of Jq (the 

topological dual of ^ ) by setting <o , <J>> = /dt/ R 4>(t,r) a (dr;t) 


dt . 
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for all <j> £ The topology we choose for , and its subsets 

i and , is the weak star topology in ® (the 3 topology of 
53 ) • It follows that lim c . = o implies lim / dt / 4> (t , r) o . (dr ;t) 

i^°° 1 R 

/dt / <J> (t , r) 0 (dr ; t) for every e 
R 

Sets "ft ^ and * of restricted controls . 

For a given mapping R from T to the class of nonempty 
subsets of R, we set = {p e /t|p(t) e (t) on t} and 

^ = {a e 'S |a(R # ( t) ,t) =1 on T } , where R^(t) is the closure 
of R # (t). 

We shall consider mappings R satisfying either or both of 
the following assumptions ([ 5, Assumption 2.3, p. 631]): 


(3.1.1) For every e > 0 there exists a closed subset of 

T, of measure at least | T [ - £, such that for every t £ 

and every r £ ( t ) there exists an original control P £ , 

continuous at t when restricted to T e , and such that 
1 |p(t) , r| < €. 

! (3Tl.2) For every £ > 0 there exists a closed subset T g of 

i 1 # 

T, of measure at least |T[ - c, such that the mapping R , when 

restricted to , is continuous with respect to the Hausdorff 

distance of sets |R^(t^) , R^(t 2 ) I (where, for A, B C R, 

|a,b| * inf (h| A C U(B,h) , B C U(A/h) } and U(A,h) is the 
h-neighborhood of A in R) . Here we identify all subsets of 
| R whose mutual Hausdorff distance is 0. 


Formulation of the Ury son-type contr ol problem . 

Now let g = (g^,...,g n ), and let (t,T,v,r,b) g(t,T,v,r,l 

TxTkE x R * B + E be measurable in (t,T) for every fixed 
n n 


57 



RELAXED CONTROLS FOR FUNCTIONAL EQUATIONS 


(v,r,b) and continuous in (v,r,b) for every fixed (t,x). We also 
assume that g 1 (t , x , v, r ,b) = g 1 (x ,v,r ,b) (i = 1, . . . ,m < n) 

is independent of t for all (x,v,r,b). Let 

f (t,r, vys ,b) = 1 R g(t,T,v,r,b) s(dr) 
for all (t, x, v, b) and all s e S. We consider solutions 

(y , a, b) of the integral equation 


y ( t ) = f f(t, x, y (x ) , a (x) , b)dx (t e T) 
in x B, where is some Banach space of measur- 

able functions from T ifo 

A solution (y, a, b) is "a relaxed admissible solution" 
if (y 1 , y 2 , . . . ,y m ) e B^ (observe that y 1 (i - are 

constant on T since g X (t, x, v, r, b) (i = l,...,m) are 
independent of t) . A relaxed admissible solution (y, a, b) 
is "a relaxed minimizing solution" if y ^ £ y^ for all 
relaxed admissible solutions (y, a, b) . 


We relate the control problem just described to the general 
problem discussed in §2 in the following manner: let 
x b and Q = yft * b. We let the mappings 
(y , q) + F (y , q) = F(y, a, b) and (y, q) + c (y , q) = 
c (y , a , b) be defined, for q = (a, b) e Jj* X B and ye f by 

J 

F (y , cr, b) (t) = f f(t, x, y (x ) , a(x), b)dx (t e T) , 


c 1 (y, a, b) = /f 1 (x , y(x), a(x), b)dx 


(i = 1, . . . ,m) , 


if this defines F 
Otherwise we set. 


(y,o*,b) as an element of 


' N 

for some a e c 


•J 


a ^ 0 



F(y, a, b) = y + a, 
c(y, a , b) = 0 . 
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I 

We can easily verify that, in the case where T 

is the interval [tQ, t^] of the real axis and g (t, v, r, b) 

a special form, 

the Uryson integral equation becomes an ordinary 

I differential equation, our control problem the "standard" 

I 

control problem, and the results that follow a slight generaliza- 
tion of previous results [ 5 , Theorem 3.1, p. 633], [ 6 , Theorem 3.4, 

p. 648] . we further discuss this problem in § 4. 

We can now state existence and approximation theorems 
that we prove in §5. In both of these theorems we set = ^T, E^) . 

Theorem 3.2 There exists a relaxed minimizing solution if the 
following conditions are satisfied: 

(3.2.1) the class of relaxed admissible solutions is 
nonempty for ^ = L 1 (T, E n ) ; 

(3.2.2) R satisfies Assumption (3.1.2); 
and either 

(3.2.3) there exists a positive function i pj integrable 
on TxT and such that, for every solution (y, a, b) of the 
equation y = F(y,a,b), we have 

| g ( t , t, y(T), r, b) | <_i^(t, t) on T xt x r x b, 
or 

(3.2.4) there exist real numbers c^, p, and 3 and a 

measurable on T x t such that 0<_8<°°, l^p^ 00 , 

P 1 3, 

|g(t, t, v, r, b) | £(1 4- ! v | 3 ) i^(t,T) on T x t x e^ x r x 
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01 p/(p - 6 ) dt < 

^ and every solution (y, o , b) of the equation y = F (y , 0 , b) 

; is such that | y ( * ) | <_ c^. 

| Theorem 3.3 Let R # satisfy Assumptions (3.1.1) and (3.1.2), and let y b 
the unique solution of the equation y = F(y,a,b) for 
a = a e and b = b e B. Assume, furthermore, that 

(3.3.1) the equation y = F(y, p, b) admits at least 
one solution y in L (T , E n ) for each p e ^ in some 
neighborhood of o', and either condition (3.2.3) or condition 
(3.2.4) is satisf ied. j Then there exists a sequence f p ^ ^ ^ 

in^^* and a sequence {y^}” 1 in L 1 (T, E^) such that 

! y i = F(y i# P^, b) and lim c (y i , p^, b) = c(y, o, b) . 


4 . Necessary Conditions for a Relaxed Minimum of a Uryson-type 
C ontrol Problem . We shall investigate necessary conditions 
for a point (y, o , b) to be a relaxed minimizing solution 
in a somewhat different framework than was required in §3. 


Assumption 4.1 

(4.1.1) "2^ is 

C (T , E n ) , an( ^ t and 


either L P (T, E ) for 1 < p < 00 or 
R have the properties described in 


§3; 


(4.1.2) B is a convex subset of a linear space; 


(4.1.3) for every fixed choice of b^^ e B (i, j - l,...,m) 

2 

there exists e (0# 1/ m ] such that, for ft = 

{(u D = (u) 1 ^ ) (i , j = 1 , . . . ,m) | 0 <_ aj 13 <_ w max } C. E 2 ' the func " 

u m m . . 

tion (t,T,v,r,o) 0 ) -► g* (t , t , v , r ,u)° ) * g ( t , t , v, r ,T>+ I (b . .-b ) ) : 

i,j=l 13 

T x T x E xRxft+E has a derivative with respect to 
n n 

JL Ji J1 

(v,w Q ) , and g ff , g* , and g^c are measurable in (t,x) 
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for every (v, r, oo°) and continuous in (v, r, oj 0 ) for 
every (t,x) ; 

( 4 . 1 . 4 ) if \J ~ L P (T, E n ) then there exist measurable 
positive functions ip 0 and 4 £ on T x t and numbers a and 3 
such that 0 < a <p - 1 , 0 < ^ < p, • / *I^O ( '' T) lp^ dT < °°' 

/| *L (t ’°lp/ ( p- 1 -« ) dt < dT < 


arid, for all (t, x, v, r) e T x T x x R, 

and w a e fi # 


b. . e B, 
ID 


|cftt, t , v, r /jP ) | £ (1 + |v| g ) 4*0 (t,x) , 
|g^(t, T, V, r /jj ® ) | £ (1 + |v| a ) 4^ 1 (t/ x) , 


|g # w C x, V, r, 0J U ) I < (1 + | v| 0 I 4» 0 (t, t) ; 


if = C(T, E^) then there exists a compact set D in E n 

containing (y (t) | t e T} in its interior and integrable 4 *q 
and 4^ on T such that 

|g # (t, t, v, r ,01® ) | £ 4 ^o (?) , 


|g* <t , x, v, r,oj°) | £ 4^ (t) , 


I g c (t , x, v, r , uP) < 4j q (t) 

or 


for all (t, t, v, r) e T x T x D x K, 


b . . e B , and 
l] 


a) p e Q. Furthermore, there exists a positive function h -* | (h) 

^ # # # 

such that, for t^ , e T and g = g , g* , and g^ a , 
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/Sup | g(t. , t r v, r , u D ) g{t 2 , t, V, r ,03° ) | dx £ $(|t, , t 2 \ 

D*R*ft 


and lim § (h) = 0 ; 
h-*- +0 


(4.1.5) for k (t , t ) = f v (t, x, y(x), a( t) , b) on 
T xt, the integral equation 


w{t) = /k (t , t) w(x)dx (t e T) 
has only the solution w ( • ) =0 in 


Resolvent kernel . If there exists a measurable real matrix- 
valued function k ■ (k !) (i, j ■ l,...,n) on T x t such 
that, for every x e 2^ , the relations 

w(t) = / k(t, x)w(T)dx + x(t) a.e. in T 


and 


w(t) = / k (t, x)x(x)dx + x(t) a.e. in T 


are equivalent in 
of k . 



* 

we refer to k 


as a resolvent kernel 


We can now state necessary conditions for a relaxed 
minimum in Theorem 4.2 below. Conditions (I), (2), and (1) 

of (4.2.2) are generalizations of respectively the Weierstrass 
E-condition, transversality with respect to parameters and 
initial conditions, and transvarsality with respect to the end 
conditions of the calculus of variations. 


Theorem 4 . 2 Let (y, o, b) be a relaxed minimizing solution, 

Ji 

and let Assumption 4.1 be satisfied. be a 
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denumerable subset of # R (t) = (p(t) | p e (t e T) , 

R (t) the closure of R (t) , S (t) = {s e S| s(R (t) ) =1}, 
a convex set in some E^, f e K^, and <J> ; a 

continuous mapping with a derivative at £ and such that 
♦ *(£) = c 1 (y , a, b) = y * (i = Then 


(4.2.1) there exists a resolvent kernel k of k 

*i 

such that k ^ is independent of t for i = l,2,...,m; 

and / 1 k (*,x) | ^ ^dx < 00 if 18 L p (T, E n ) and 

/ sup | k (t,x ) | dx < » if = C (T , E )? 

te T <h 

(4.2.2) either <|>i(?)I « Min <t«J(T) C , 


or there exist a nonvanishing 


SeK 


1 ^ = 


X = (x A ,...,x“) e E n 


and y > 0 such that, setting 

X = (x 1 ,...,x m ,0,...0) = ( x,0 0) e E n 

C j (x) = Z \ i k*^ ( t) +-^- (xeT, j=l,.. 

i « i 3 |t | 

C (T) = (C 1 (T) C n (X) ) <reT), 

h 1 (s, e) - /c(t) • f ( t , e, y(e), s, E)dx 

and 


,n) , 


( 


(s ,0 )ft 5xT) , 


H 2 (b) * f T X T C(x) * Df < T ' e ' y( 0 K <?( 0 )' b ? b - b)dxde (beB) , 

the following conditions are satisfied: 

(The Weierstrass E-condition) 

(1) H. (a ( 0) ,0 ) * Min H 1 (s , 6) = 

1 S£S*(0) L 

Min / £(x) • g(x, 9, y(0), r, b)dx for almost 

reR* (0) 

all 9 c T , 


(Transversality Conditions) 

( 2 ) Min H~(b) = 0, 
b e B ^ 
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and 

(3) (ySn “ (£)£ = Min (y6 , - X)<j> r (£)£, where 

* CeK x 1 ^ 

^1 ~ ^ ^ r ^,.*.,0) e . 

In particular, if R # (t) = R for all t e T, R * ( 0) 
* 

and S (0) can be replaced by R and S, respectively, 
in relation ( | ) . 


An illustration . As an illustration, we shall apply Theorem 4.2 

to the following "standard" relaxed control problem: let T be 

the closed interval [ t Q , t x ] of the real axis, dx the Lebesgue 

measure on T, B, C E , B a convex subset of some E„, d> 
i m £ 0 

a continuously differentiable mapping from B into E^ with 
the image Bq , and h: T x E^ x R E^. We wish to determine 
functions x: T + E m and a: T + S that yield the minimum of 
x 1 (t^) among all absolutely continuous x and all measurable 
(in the previously defined sense) a that satisfy the relations: 

— r- — */h(T / x(T) f r)a(dr?T) a.e. m T, 

R 

x(t Q ) e B q , x ( t x ) e B . 

We set n = 2m, g = ( g 1 , . . , g 11 ) , y = (y 1 ,.. ,y n ) , v = (w 1 ,w 2 ) 
with w^ f W 2 e E^, and, for all (t,T,v,r,b) eTxT*E n *R*B 
an d 1 = 1,2 , • • , in , 

y (t) = x (t) , 
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y i (t) « x i (t i ) , 

h(x f w 2 ,r) + 4> 0 (b)/(t 1 -t Q ) for t <_ t 

g 1+m ( t , x , v, r , b) = 

<j> 0 (b)/(t 1 -t Q ) for t > t 

g 1 (t ,x , v,r,b) = g 1+m (t 1 ,T f v,r / b) . 

We then observe that our new problem is formally equivalent 
to the Uryson-type control problem considered in §3 and in the 
present section. We can easily verify that Theorem 4.2 is ap- 
plicable if we set = C{T # E n ) and assume that 

(x ,w, r) + h (t ,w, r) and h w (x,w,r) 

exist on T * v r, are continuous in (w,r) and measurable in 
x, and that |h(x,w,r) | and |h w (x,w,r) | are bounded by an 
integrable function of i for all (w,r) £ D * R, where D is 
some compact set in E^ containing the trajectory {x(t) |t e T} 
in its interior. 

* 

We can evaluate the resolvent kernel k by a straight- 
forward (if somewhat tedious) computation and determine that 

C j (T) = . ■ 1 . X j = — — — r — z j (t,) (x e T, j = 1 , . - ,m) , 
^ l"^0 10 

C^ +m (x) = -dz^(x)/dx (x e T, j = l,..,m), 

where the absolutely continuous function x z(x): T -* E m is 

the solution of the system 
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= z ('t)=-A T (t)z(t) a . e . in T # 

z(t 1 ) = A, 

A is the transpose of A, and A(t) - f h (x ,x (t) , r) a (dr ; t) (xeT) . 

R w 

It follows then that 

H (t),r) = z(9) • / h(9,x(9) , r) a (dr ; 9) + 1 z(t-> -x(t n ) (9e:T,reR; 

1 R ]TT 

Thus relation (1) of Theorem 4.2 yields the familiar Weierstrass 
E-condition (maximum principle) for the relaxed control problem 
defined by ordinary differential equations. In a similar manner, 
relations (2) and (3) yield the support ( transversality ) condi- 
tions at the initial and terminal points t Q and t^, respectively. 


66 


RELAXED CONTROLS FOR FUNCTIONAL EQUATIONS 


5. Proofs of the Statements in §2 . The proofs of Lemmas 2.1 
and 2.2 are trivial and will be omitted. 

Proof of Theorem 2.3 We first consider the special case 

where ={ (b 1 , . . . , b m ) | b^ = 0 (i = 2,...,m)}. 

Consider the equation 

_ m . . __ 

(5.1) y * F (y , q + Z 0 ) 1=1 (q. . - q) ) 

irj = 1 3 

for an arbitrary choice of q° = (q^j) with e Q. We 

can apply, with minor changes, the proof of the implicit 
function theorem [II, p. 265] to show that there exists a 
neighborhood Y x n of (y, 0°) relative to x ft such 

that equation (5.1) has a unique solution y = n(^ D f q*) e Y 
for every o) a e ft and the function -► n (io^, q°) : ft -► Y 
is continuous and has a derivative at 0 fl . It follows that 

ID 0 + c(w D , q°) = c(n(w , q a ) , q + £ ur 3 (q.. - q) ) : 

if j = 1 3 

a 

is also continuous and has a derivative at 0 . 
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Now let 6 11 = 0, 0 i -’ = 0 { ( i , j ) yi (1,1)), q^ = g(i,j = l,..,m), 

h ( 0; q) = c(6°, q°) , V = {dh(0; q)/de | q e Q) , and let W 

be the convex cone in E generated by V; that is, 
m m . in 

W = { Z a 1 v. | v. e V , a 1 >_ 0 } . We shall show/vJie sequel 
i = 1 

that there exists X e E m such that |x| ^ 0, X >_ 0 , and 

X’w >_ 0 for all w e W. 

If this last statement is true, then x * dh(O;q)/d0 >_ 0 
for all q e Q. We have dh(O;q)/d0 = 

c (y^q)n ^ (0 n , q D ) + Dc (y , q; q - q) . Also, the differentiation 
of both sides of the equation 3°) = F(n(9 tJ ; q u ) , q + 0 {q - q) ) 

with respect to 0^ at 0 D yields 

n ^(0°, q°) “ F (y, q)n H ( ° C ' + DF(y, q; q - q) . 

We then conclude that 

(5.2) X ■ dh(0;q)/d6 = X * {c y (y,q)(I - F y (y ,q) ) “ 1 DF (y ,q; q - q) 

+ Dc(y,“q; q - q) } >_ 0 for all q e Q- 

We now proceed to prove that there exists a point x as 
just described. Indeed, assume the contrary. Then it follows 
from elementary properties of convex sets that there exists 
a point w = (w'*’, 0 0) in the interior of W, linearly 
independent w^ e W and positive a 1 (i * l,...,m) such that 

1 m i 

w 1 < 0 and w = Z a w . . 

i = 1 

By the definition of W, there exist points q^ and numbers 
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a 1 * 1 (i, j = such that a 1 - 1 >_ 0 and 



dh ( 0 ; 


q i; j)/de 


(i = 1, ... , 111 ) . 


Since the are independent, the matrix (w?) (i, j = l,...,m) 

is nonsingular. 

Now let f = {y e E |0<y 1 < Y TTiav (i = l,...,m)} , where 
m max 

Ymax i- s positive and sufficiently small so that tL) C (y) = 

(o) 1 ^(y)) = (Y 1 a 1 -^) e 5, and consider the function y+My) = 

c((jo d (y); q p ) : r -► E . This function is continuous and has 

a derivative k^,{0) = Ok (0 ) /By 3 " t * * * / 9k (0 ) /?y m ) at 0 relative 

to f (where 9k(0)/Oy 1 are right-hand derivatives). Furthermore, 

a m n m y . 

3k(0)/3y^ = I dh ( 0 ; q..)/d0 • (0 ) /3y X = I dh(0;qa,)/de * a X 3 

i,j-l 13 j = 1 /3 

= w^ ( Jl - l,...,m); hence the derivative k^,(0) = ( w ?) has an 
inverse and k^(0)ot = w = (w 3 " , 0 , . . . , 0 ) for a = (a 1 , . . . , ot m ) . 

It follows (as in [$, p. 650]) that there exists a solution 
€ + y(c) of the equation 


k { y (€ ) ) - k (y (0 ) ) = ew 

for all sufficiently small positive c, and y 1 (G)— » +0 (i = l,..,m). 

_ m . . + 

There exist, therefore, q = q + I co 1 - 3 (y (e) ) (q . . - q) e Q 

irj-1 13 


and y = r\{m a (y (e)); q n ) 0 ^ 


(i 


such th; 


that y = F(y,q), c 1 (y,q) = 0 


2,...,m), and c (y,q) < c (y, q) , contradicting the 


assumption that (y,q) is a relaxed minimizing point. 

We conclude that when = { (b 3 " , . . . ,b m ) | b 1 = 0 (i = 2,. 

there exists a point A e E such that j A j f 0, X^ _> 0, 


,m) } 


and relation (5.2) is satisfied. 
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We now consider the general case and define the sets 
and and the functions F^ : C'j x and 

c# : Q# * E ro +1 b * 

= Q x K^, bJ = {(v°, v\...,v m ) I v i = 0( i = l,...,m)} 

F # (y,q,S ) = F (y ,q) , 
c # ° (y ) = c 1 (y ,q) , 

c # ^(y,q,£) = cAy^q) - <p'^(Z) ( J= i,..,,m). 

Clearly, the point (y,q,X) is a relaxed minimizing point 
for the problem defined by bj, , B^ , F^ and c# , which is 
of the form just investigated. The conclusions of the theorem 
follow from relation (5.2) applied to the transformed problem; 
the details of the argument are as in [6, Proof of Theorem 2.2, 
p. 6503 . QED 


6 . Existence of relaxed and approximate minimizing solutions, 
for Uryson-type problems. Proofs. 


Lemma 6.1 Let conditions (3.1.2) and (3.2.3) be satisfied, 


and let 


(y £ 




F(y, a , b) , a e jj* , b e B} . Then every 


sequence {y i > i ^ in Y 2 has a subsequence converging to 
some ye - L 1 (T, E n ) . 

' n 


Proof; Let 


F(y i 7 a i 9 b i } (i 


1, 2 ,...,), y^ e 


a. e * and b. e B. We must show that there exist a sequence 


j of natural numbers and a point y in 


lim y, = y in 


ieJ 


V- 


V 


such that 


Let, for v e E n / X(v) = 1 if |v| £ 1 and 
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x(v) = -L if |v| > 1 , 

|v| 

g ( t , x, v, r, b) = x(|g(t, t, v, r, b)|/iMt, T))g(t, x, v, r, b) , 

and 

f(t, x , v, s, b) = / R g (t , t , v, r, b)s(dr), 

for all t, x, v f r, b and all s e S. Then, by (3,2,3), 
g(t, x, y(x), r, b) = g(t, x, y(x), r, b) on T x t x r x b 
for every y e Y^? hence every solution (y, a, b) of the 
equation y = F(y, a, b) also satisfies the equation 


(6.1.1) y (t) = / f(t, x, y(x), a (x ) , b)dx (t e T) . 


Furthermore , 


|g(t,x, v,r, b)| ( t , x ) on txtxe^xrxb 

and g is continuous in (v, r, b) and measurable in (t, x). 

NOw let ji(t) = /^(t, x)dx on T, S N = {v e E n | | v| <_ N}, 

P N = {t £ T | iMt) <_ N}, and c > 0. Then there exists 

N = N (e ) such that, for P = P , . 

N (C ) t 

(6.1.2) f dt/ T _ p ^(t, T)dT < |c. 


Since q is measurable in (t, x) , continuous in (v, r, b) 


onthe compact set x r x ] 

the restriction of g 1 to 

i = l,...,n, an element of 

exist, therefore, an integer 

a r l 1 ^T x P, e t and 6, i 
j n' - 


1, and | g ( t , x, v, 

r , b) | 

< l|>(t 

T x p x S., x R x B 
N 

is, for 

each 

L 1 (T x p f C (S N x R 

x B) ) ; 

there 


k = k(e) and functions 

C(S kt x p x b) such that 
' N 


T) , 
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(6.1.3) f f Max | g(t,i,v,r,b) 
S >t xRxB 

N 


k 

Z 3 . (v,r,b) a. (t,x) 

j=i 3 3 


dtdx 



Furthermore, each oij e L (T x p f E n ) can be approximated 
by a finite sum I b^(x) a^(t) , where b^ are measurable 
characteristic functions on P and e L (T, E n ) . We 

conclude, therefore, that relation (6.1.3) can be rewritten, 
by appropriately changing the definitions of k and 3 j , as 


A. "I 

(6.1.4) / / Max |g (t ,T,v,r ,b) - Z 3 • (v,r ,b) b . (x) a . (t) | dtdx <_-j€, 

1 F S xt xrxB j = l 3 33 q 

N J 

and we may also assume that |3j(v,r,b) | < 1 on S^xRxb. 

Now let 

Tji = Y j i (e ) = / p b^ (x)dx / R 3j (y ± (t) ,r,b i )a jL (dr?x) . 

We observe that 

|y i (t) | £/|f(t,x,y i (x),a i (x),b i ) | dx < f\p (t,x)dx <_N = N ( € ) 

for t e P = P (e) and all i = 1, 2, 3.... Therefore, for 
all integers p and q, for all t e T, and for k = k(c), 

|y p (t) - y q (t)| < 2/ T _p i)i(t,T)dT + |/ p dx{/ R g(t,T,y p (x) ,r,b p )a p (dr;t 

- g (t,T,y (t) , r,b )a (dr 

k K g g g 

1 2/ T _p i|»(t,T)dT + | £ (Yjp - Yj q ) a j (t) | 

k 

+ Zf Max | g (t , x ,y . (x ) , r ,b . ) - Z 3 . (y . (t) , r ,b . ) b . (x) a . (x) | dx ; 
i=p,i=q F reR 1 1 j=l 3 1 1 3 3 

hence, in view of relations (6.1. 2) and (6.1.4), 


(6.1.5) / |y <t> - y (t) |dt < j C + 


k 

I 


3P 


- Y jq |/ T |a.(t) |dt + 2 £ * 
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Given any infinite subsequence J of {1, 2,...}, we 
can determine a subsequence = J^tJ, c) such that the 

sequences have a limit, for each j = 1, 2,...k(e) 

since lY-^l 1 l p l 1 l T l for a11 1 and 3- Now let 

Jq = {1, 2, . . . } , J ^ + -^ = J ^ jL 9 ^ ^ ^ ^ ^ = ® ' i ' 2,.,.), 

and let J be the diagonal subsequence of Jq, 

Then converges for each e > 0 and j = 1,2,..., 

and there exists an integer i Q = i Q (e) such that, in view 
of (6.1.5), 

/ T |y p (t) - y q (t) I dt l c 

provided p > q > i Q (£) and p,q e J. 

We conclude that is a Cauchy sequence in 

L^(T, E^) and converges, therefore, to some y in L^"(T, E n ) . 


k(e) , 


QED 


Lemma 6.2 Condition (3.2.4) implies condition (3.2.3). 


Proof. Let (y, a, b) satisfy the equation y = F(y,a,b), 
and assume that condition (3.2.4) is satisfied. Then, for 
t e T, 

| y (t ) l £ | / T dx/ R g (t,T ,y (t) ,r,b)c (dr; t) | < f ( 1 + |y (x) | B (t , T ) dx 

and, by Holder's inequality, 

1 y (t) I < j> 0 (t r T)dT + |y(-) I® k 0 (t,-) l p/( p_e)i I V*' - ) lp/( P -B) |T l f 

l’»'o (t '- )l p/(p-B) °1 - ^V^-^p/te-B)' where y=^ + l T ! B/P - 
It follows that, for all (t, t, r, b) e T * t x R * B, 

|g(t,T, y(T), r, b) I < (1 + Y 6 ko (T, • ) |p/ (p _ B) J+o (t- T ) 
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Now let the expression on the right be denoted by 
let c 2 ={/|i|' 0 (t,- ) 1 p/ ( p _g ) dxj^and let 

^ (t / T ) = |p/(p_g) Then, by Holders inequality. 


f'p' (t ,t) d T < k 0 (t,-) | p/(p _ 6) {/|* Q (t,-)| 


and 


Also 


ffty ^ (t , t ) dtdT £ | T | (p ^ 


p/ (p-6) 


dt } 


3/p 


■^TxT^O dtdx < °°* It follows that ( t / t ) •+ i^(t,T) 

is integrable on T x T. QED 


6.3 Proof of Theorem 3.2 . Because of Lemma 6.2 it suffices 
to assume that conditions (3.2.1), (3.2.2) and (3.2.3) are 
satisfied. Now let { (y^ , , b^)}^_^ be a sequence in 

X X B and y L = F(y i( , b^. By (3.2.2) and 

[S, Theorem 2.5, p. 6 32] the set # is sequentially compact, 
and by Lemma 6.1 there exists a sequence J and a ye ZJ^ 
such that lim y^ = y . We may choose J so that lim a . = a 

4/# - 

and lim b. = b for some a e ^ and b e B, and 

ieJ ^ 

lim y = y a.e. in T, say for t e T / . 
ieJ 

For e^ch fixed t and t in T / , g(t, x, . , . ) is 

continuous, hence uniformly continuous, on the compact set 

x r x b, where is a compact subset of containing 

y(x) in its interior. It follows that lim g (t ,x ,y . (t) , - ,b. ) = 

ieJ 1 1 

g(t,T,y(x) , * ,b) uniformly on R and 


lim / (g (t ,x ,y, (x) ,r ,b . ) - g ( t, x ,y (x ) , r ,b ) ) a . (dr; x) = 
ieJ 1 1 1 

lim a. (t,x) = 0 for all t,x e T Furthermore, 
ieJ 
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| CU (t,x) I < 24 > (t,T) on x T 7 and is integrable; 

therefore, for each t e T^ 

y (t) = lim y. (t) = 
ieJ 

lim / T dx/ R (g(t / T,y i (T) , r^) - g (t ,t ,y (t) ,r ,b) ) (dr ; T ) 
ieJ 

+ lim / dx g(t,x,y(T) ,r,b)a i (dr;t) 
ieJ 


= lim / T dx ^ g (t ,x ,y (x ) ,r ,b) (dr; x) 
ieJ 

= / T dx / R g(t,x,y(x) , r , b ) a ( dr ; x ) = (t ,t ,y (x) ,a (t) ,b) dx , 


since the function (T,r) •+ g (t ,t ,y (x ) ,r ,b) e *2> (as defined in §3). 

Thus y(t) = F(y,c,b) (t) for t e T 7 . By redefining y 
if necessary, on T - T / , we can assert that y = F(y,a,b) and 
thus the set of solutions of y = F(y,a,b) is nonempty and 
sequentially compact in ^ ^ * B. Since y^ = c^(y,a,b) 

(j = l,...,m) for every solution (y,a,b), the y^ (j = l,...,m) 
are constant, and B 1 is closed, it follows that there exists 
a minimizing relaxed solution. QED 


6.4 Proof of Theorem 3.3. By Assumptions (3#l. 0 a [f, Theorem 2.4, 
p. 631], the set is a dense subset of There 

exists, therefore, a sequence converging to 

a. By (3.3.1), there exists an integer i Q and a sequence 

{y.}°°_. in }J such that y. - F(y. , p . , b) . It follows then, 

1 1_1 ° ^ 1 lj 

as in the proof of Theorem 3.2, that there exist yet/ 


ana a sequence 


_U j_l 4_ 1 ' m , 7 

au^u Liiau x i “ i t 


lim « — 


and y = F (y ,o ,b ) , 


ieJ ^ie J_ 

By the uniqueness assumption, y = y. 
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Thus lim c 3 (y . , p. , b) = lim y? = y 3 (j = since 

ieJ 1 1 

the y? (j = l,...,m) are constant. QED 

7. Proof of Theorem 4.2. We shall use the notation and the 
assumptions of §4. We also set, for a fixed choice of 


(i/j * in B and a i: *(i,j = in 


J*, 


e ft, 

v e E n' 

y e 

m 



b + E 

£ 

n 

H 

3 

- b) 

if j= 

l 13 

m 



C + E 

u^ 3 (a . . 

- a) 

if j- 

1 13 


f(t,x,v, 0 J D ) = f (t ,x ,v,a(t; 0 ) D ) , b (oo D ) ) y 
F(y,aj a )(t) = ff (t , t ,y (t ) , co c ) dx . 


Lemma 7.1 Let b.^. and - (i # j = l,...,m) be fixed. Then 

~ i in some neighborhood r of (y/0'j 

(y,w D ) + F(y f tjj D ): 1 x n + LA i s continuous^and has a derivative 


at (y, 0 D ) , the partial derivative F 


0 ° ) 
itiv 

exists and is conti 


nuous on T7 an ^ t ^ e following relations hold: 

(F y (y,w 0 )Ay) (t) = / T f v (t,x,y (t) ,u)°)Ay (x)dx (te Tlye^Ay^, 

/0 d ) (t) - / T f (A)0 (t,T,y (t) ,0 p )dx (t e T, y e , 

(Fy (y , 0 n ) Ay) (t) = k (t , t ) Ay (x ) dx (t e T, Ay e , 


, u 


and 


F ^y^O^ (t) - / T f (t,x f y (x) ,a 11 (x) - cr(x),b)dx + 


/ T Of (t,x ,y (t) , c(x), b; b ^ - F)dT (t e t). 


eft) , 
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Proof: We first consider the case = L P (T, E n ) for 

1 < p < co. Let y e ^ and u)\H be fixed. We observe that 
the function (t, x) + f (t , x ,y (x) ,w lt ) is measurable on 
T x T and / | f (t ,x ,y (x ) ,u) ) | p dt £ 

/|^ 0 (t,T)| p (1 + |y(T)| e ) P dt < k 0 (-,T)| P (1 + |y(t)| 8 ) p < - 
for almost all x e T. Thus the function t + f (t,x,y (x) ,u> 0 ) 
belongs to L p (T,E n ) for almost all x e T and [9, Lemma 16, 
p. 196] x -► f (■ ,x ,y (x) , 0 )*) is a measurable function from 
T to L P (T, E n ). Furthermore, x -► l+|y(x)|^e L p ^^ (T) 
and x -v k 0 ( *' x) lp e L p/,(p “^(T); hence, by Holder’s inequality, 

/ | f ( • , x ,y (x) , 0 )^) | p dx < / | (* ,x) | p t 1 + |y(x)|^)dx < ®. 

Thus x ■> f ( • ,x ,y (x) , 0 )°) is an integrable function from T 

and oj^e £2, 

to L P (T, E n ) for all y e L P (T, E n ) and F exists on 
L P (T, E n ) x Q . 

Now consider the continuity of F and the existence and 
continuity of F y . We have, for fixed y e and u) a e Q, 

and for all Ay e ij^, 

(7.1.1) F(y + Ay,u> 0 ) (t) - F (y , w 0 ) (t) * 

/{ f(t,x,y(x) + Ay (x) , w a ) - f (t ,x ,y (x) f oP) }dx = 

/f v (t,x,y(x) + 9 (t,x)Ay (x) ,a> D ) Ay (x)dx a.e. in T, 

where 0 £ 0(t,x) <_ 1, and we may assume (using essentially 
the argument in [13, Lemma 18.1, p. 177]) that (t,x) -► 9(t,x) 
is measurable. Furthermore, 
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| f v (t,T,y (t) + e (t,T) Ay (t) ,U3°) I £ 

(1 + <|y(T) | + | Ay (x) |)°)t|i 1 (t,T) ? 
hence 

/ | f v (t,x,y (t) + 0 (t ,t) Ay ( t) ,o)°) | p dt < 

(1 + ( | y (t) | + |Ay(x) |)“) p |K Ii (-,t) |P . 

It follows that t ■+ f (t,T,y(x) + 0 (t ,t) Ay (t) ,w d ) 

belongs to L P (T,E 2 ) for almost all t e T and 
n 

|F(y + Ay, oo n ) - F(y,w D )| p <_ 

/( 1 + ( | Y (t ) | + | Ay (t) | ) a ) \ty 1 (• ,t) | p | Ay ( t ) | di 

We can easily verify that, for a fixed y in y, the 
coefficient of | A y ( x ) | in the integrand on the right has an 
L P/(P 1) n orm bounded by some constant c^ for all Ay in 
the unit ball of L p (T,E n ). 

We conclude that 

|F(y + Ay,w D ) - F(y,w D ) | < c.^ | Ay | 

for all Ay e L P (T, E n ) and e ft. Thus y -*» F(y,co°) 

is continuous at every y, uniformly in w 0 e ft. 

Our previous argument shows that the function 
Ay -*■ i9^(Ay; y) = /f ( • , t ,y (t ) , u> n ) Ay (t ) dx is a bounded 
linear operator on L P (T, E ) for every (y,oj a ). Relation 
(7.1.1) now yields 


78 


RELAXED CONTROLS FOR FUNCTIONAL EQUATIONS 


(7.1.2) |(F(y + Ay,!!) 0 ) - Fty.ui 0 ) -/^(Ay; y))(t) | < 

< f Sup|g*(t,T,y(T) + 6 (t , T ) Ay ( T ) - g* (t,T,y (T),r/) | I Ay ( x ) ( dx. 

Rxft 

As Ay converges to 0 in L P (T, E n ) , hence also in measure, 
the coefficient of | Ay ( t ) | in the integrand on the right 
converges to 0 in measure, as a function of t, for almost 
all t e T. This coefficient is also bounded by a(t,x) = 
i|> 1 (t,T)(2+ |y (t) |“ + ( | y (x ) | + | Ay ( t ) | ) a ) , and we verify 
that t - |a(t,-) | /( 1} belongs to L P (T). It follows, 

applying Holder's inequality to the right side of (7.1.2) 
and then taking the L p -norm with respect to t, that 

lim |F(y + Ay,u)^ - Fty,^ 17 ) - ^(Ay; y) | /|Ay| = 0 

I Ay Ip -*■ o p p 

for every y e L P (T, E n ), uniformly in oj° £ (1 ? hence 
Ay; y) = F (y , w a ) Ay (Ay e L P (T , Ej ) , and 
Fy(y,wtO is the operator Ay -*■ ? v ( • r t ,y (t) , 0 )°) Ay (t )dx . 
Thus F y (y,w°) and F y (y,0 D ) have the form indicated in the 

statement of the Lemma. 

The argument we have used to prove the existence of F y 
via inequality (7.1.2) and Assumption (4.1.3) can be 
used to show that 

iFy^ ,<D n ) - F y (y 2 ,u)°)| -*■ 0 as y 2 ♦ in 'ijj uniformly 

in a) 13 . Thus y + F(y,w I:7 ) and y -*■ Fy(y,co^) are continuous 
at each y, uniformly in e ft. Similar arguments show 

that w n + F(y,u)°) and w* 1 Fyty,^* 7 ) 
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are continuous for each ye , whence we conclude that 
(y -*■ F(y,io^) and (y ,m u ) ■* F^(y,u f - i ) exist and are continuous 
° n ^L* n ' Finall y- the existence of F (y, 0°) follows 

from that of f jj(t ,t , y (t ) , 0 ) and the bounds in (4.1.4). 

w 

Thus (y,u D ) -<• F(y,a)°) has a (total) derivative at (y, 0°). 

The same conclusions can be reached by similar arguments 
when ~lj = C(T, E n ) . QED 

Lemma 7 . 2 The mapping I " F (y, o, E) is a linear homeo- 


morphism of onto and statement (4.2.1) is valid. 

Proof : We have shown in Lemma 7 . 1 that 


(F y (y, a, E")Ay) (t) = /k (t,x ) Ay (t) dx (t e T, Aye \jj . 

By Assumption 4.1, k is measurable on T x t and |k(t,T)| 

is bounded by ji(t,T) = (1 + |y (t) | °) ^ (t, t) for 1,^= L P (T, E n ) . 

We verify then, as in Lemma 7.1, that / | $ ( • , x ) | ^ <P -1 ) dT < 

It follows [tt, p.518] that F y (y, a, b) is a compact operator 

on L P (T, E n ). Similarly, if ^7 - c(T, E n ) , the family of 

functions t -+ fk (t , t ) A y ( x) dx corresponding to all Ay such 

that Max | Ay (t)| < 1 is uniformly bounded and has the common 
t e T 

modulus of continuity $. Thus, in both cases, F y (y,a, b) 
is a compact operator. It follows, therefore, from (4.1.5) 
that I - F y (y,a,b) is a linear homeomorphism of onto 

X/ Theorem 5, p. 579]. 




Let K = F y (y , a , b ) and 


= (I 


K) -1 - I. 


For E n ) » arguments of [lH,pp. 157 - 160] 

(applying to the case n = 1, p = 2) can be suitably generalized 
* 

to prove that K is an integral operator such that 

(K Ay) (t) = fk (t,T)Ay(x)dT (t e T , Ay e J^) , where k is as 
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described in (4.2.1). (These arguments, in their generalized 

form, are based on approximating the function t + k(*,x) 

in L P// D (T, L P (T, E n ) ) by finite sums of the form 

£<Xj (t) 8 j ( * ) ) • Finally, since k^ (i - l,...,m) are independent 

of t and K = K + KK , the k ^ (i = l,...,m) are also 

independent of t. 

For C C ( T ' E n ) * we observe that since K = K + KK 
and K is compact, so is K . There exist, therefore [15* , 
Proposition 9.5.17, p. 665], a measurable k ff = (k .) (i, j = l,...,n) 
on Txt and a nonnegative regular Borel measure y on T 
such that 

(K Ay) (t) = /k* (t ,t) Ay (t) y (dx) (t e T, Ay e '^/) 

# * 

and Sup j* T | k (t , x ) | y (dx ) < Our conclusions about k 

teT 

will follow directly from the Radon-Nikodym theorem once 
we prove that, for all t e T, the measure A ■* / k (t,x)y(dx) 
is absolutely continuous with respect to our original measure 
A -*■ / A dx. This we can do by observing that if K Ay^ r-^> 0 
in ^ , so does K Ay^ = (I + K )K Ay^ ; and then considering 
any sequence {A^} of Borel sets in T such that |A^| 0 

and "approximating" their characteristic functions with continuous 
functions a^ such that 0 <_ a^(t) <_ 1, a ^(t) =1 on , 

a^(t) = 0 on T - , where O A^ O are closed, 

are open, and y (G^ - C^) + j.G^ - | •> 0 as i -+ 00 . QED 

7 . 3 Completion of proof of Theorem 4.2. Lemmas 7 . 1 and 7 . 2 
show that Theorem 2.3 is applicable to the control problem as 
defined in §4, and that statement (4.2.1) is valid. We have, 
for q = (a ,b) , a 11 = a , b ^ = b ; 
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DF(y f q? q - q) = DF(y,U,b ; (Q/b) - {o , b)) - F 

CO 

and 

c 1 (y#q) = F i (y,q) (i = 

Let K* be the linear operator on V defined by k and 

let X = (X,0,...,0)e E r . Then K* = (I - F y (y,q)) 1 - I 
and, applying Lemma 7.1, relation (2.3.2) can be rewritten as 

X‘{F y (y,q) (I - F y (y,q)) _ 1 DF(y,q; q - q) + OF (y ,q; q - q) } 

(7.3.1) 

= X- (I + K*)DF(y,q; q) 

= I X j /_{f j (0,y(6) ,o(6) -o(6),E) + Df' j (0,y (9) ,a (6) ,b;b-b) }d0 

j=l 

+ E E k 4 (t ) { f 3 (t , 9 ,y ( 9 ) , <3 ( 0 ) -o (0 ) , E) +Df 3 (t , 0 ,y ( 9 ) ,0 ( 6 ) ,b ;b-F) }dxd 

i=l j-1 TXT 3 

=/ T de/ T c ( t) • C f ( t, e,y (6) ,o (0)-?(0) ,b)+Df (t,0,y (0) ,a (8) ,b;b-E) }dT 
= f T H 1 (a(0),0)d0 - ; T ^(ate) ,0)de + H 2 (b) > 0 

for all (o,b) 0 : J* * B. In particular, for a = a, 

H.(b) > 0 = H, (b) for all b e B. 

2 ^ ^ 

It remains now to prove relation (I). Let ' s tf£ eo = 
i e {1, 2,..}, E be an arbitrary measurable subset of T, 
b = b, and a (t) = Pi (t) for t e E, a (t) - a(t) for 
t e T - E. Then relation (7.3.1) yields 

/_ {H n (p. O) ,9) - H (a (0) ,0) }d0 >0, 

h. 11 
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where H 1 (r,0) = H 1 (s r ,0) and s r is a measure concentrated 
at r with mass 1. It follows that for each i there exists 
a subset of T , of measure |t|, such that 

H^(p^(0),0) >_ H 1 (o"(0 ) , 0 ) for all 0 e T\. 

oo 

Then, for T /= f] T. , 
i = l 1 

(7.3.2) H 1 (r,0) = ; T c (t) -g(T,0,y (6) ,r,b)di > H 1 (a(0),0) 

/ * 

for all 0 e T' and r e R (0). We verify, using properties of 
* 

k described in (4.2.1) and the bounds on g described in 
Assumption (4.1.4), that t ■* £(t) * g (t , 0 ,y (0 ) ,r ,E) is 
bounded for all r and almost all 0 by an integrable function 
of t. Since, furthermore, it is also continuous in r, we con- 
clude that relation (7.3.2) is valid for almost all 0 and 
_ * 

all r c R (0) and, integrating both sides with respect to 

* 

any s e S (0), that relation (I) is valid. 

n Ji 

When R ff (t) = R on T, we may choose as ^ m any set 
of constant functions from T to R whose images form a 
dense subset of R; then R (t) = R and S (t) = S for all 
t e T. QED 
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8. Proof of Theorem 1.3.5 . Those parts of Theorem 1.3.5 that 
refer to the existence and necessary conditions follow directly 
from Theorems 3.2 and 4.2 whose assumptions are weaker. The 
statement asserting the existence of "approximating" sequences 
{p_.} and { y j } will follow from Theorem 3.3 if we can prove 
that the equation y = F(y,p,b) admits at least one solution 
y in L^(T,E n ) for each p This last statement follows 

from the fixed point theorem; indeed, for each p, the mapping 
y **■ F (y , p , b) is continuous in C(T,E n ) and, because of the 
boundedness and the uniform continuity of g, the image of this 
mapping is contained in a convex and compact set of functions in 
C(T,E n ) with a common bound and a common modulus of continuity. 
QED 


References 


[1] A. Friedman 


[2] M.N . Oguztoreli 


[3] A. Halanay 


[4] A . G . Butkovskii 


Optimal control for hereditary 
processes, Arch. Ration. Mech. Anal. 

15 (1964) , pp. 396-416. 

Time-Lag Control Systems , Academic 
Press, New York, 1966. 

Optimal controls for systems with time- 
lag, S.I.A.M. J. Control 6 (196 8) ,pp . 215-2 34 . 

The maximum principle for optimum 
systems with distributed parameters, Autom. 
and Remote Control 22(1961), pp. 1156-1176 . 


84 


RELAXED CONTROLS FOR FUNCTIONAL EQUATIONS 


[51 J. Warga 

m 

[7] 

[&] 

m 

[!0] E.J. Me Shane 
[II] J. Dieudonne 


Functions of relaxed controls, 

S.I.A.M. J. Control, 5(1967), pp. 628 - 641. 

Restricted minima of functions of 
controls, S.I.A.M. J. Control 5(1967), 
pp. 642 - 656. 

Minimizing variational curves 
restricted to a preassigned set, 

Trans. Amer. Math. Soc. 112, (1964), 

pp. 432 - 455. 

Unilateral variational problems 
with several inequalities, Michigan 
Math. J. 12(1965), pp. 449 - 480. 

On a class of minimax problems in 
the calculus of variations, Michigan 
Math. J. 12(1965), pp. 289 - 311 

Necessary conditions in generalized - 
curve problems of the calculus of 
variations, Duke Math. J. , 7(1940), 
pp. 1-27. 

Foundations of Modern Analysis, 

Academic Press, New York, 1960. 


[\l] N. Dunford and 
J. Schwartz 


Linear Operators, Part I, Inter- 
science, New York, 1964. 


[133 


M.A. Krasnoselskii 
and 

Ya. B. Rutickii 


Convex functions and Orlicz 
spaces, P. Noordhoff Ltd., Groningen, 
1961. 


[IS] F. Riesz and 
B. Sz. - Nagy 


Functional Analysis, F. Ungar 
Publ. Co., New York, 1955. 


[l£j R.E. Edwards Functional Analysis, Holt, 

Rinehart and Winston, New York, 
1965. 


Northeastern University 
Boston, Massachusetts 


85 



ORIGINAL MINIMIZING CONTROLS FOR INTEGRAL EQUATIONS 


By J. Warga 

Professor, Northeastern University 
Boston , Massachusetts 

NASA Grant NGR 22-011-020 


87 



I 

! 

ORIGINAL MINIMIZING CONTROLS FOR INTEGRAL EQUATIONS 
| by J- Warga 

1. Introduction . We consider a class of variational problems 
defined by the Uryson-type integral equation 

: y(t) = (y 1 (t) , . . . ,y n (t) ) = / g (t,x,y (t) ,p (x) , b) y(dx) (teT) 

I T 

| where p is chosen from a given set °f "original" (unrelaxed) 

| controls and b from a given convex set B of control parameters, 

j We have investigated, in [1] , a related problem in which the 

set was imbedded in a set of 

| measurable relaxed controls, and have discussed the existence of 

a minimizing relaxed control, its approximation by original 
controls, and necessary conditions for a relaxed minimum. Since, 
as it is well known from the control theory of ordinary differentia 
equations, the existence of a minimizing original (unrelaxed) 
control cannot be assured, except under very restrictive conditions 
we begin the present study with the a priori assumption that there 
exist an original control p e 91 and a parameter b e B that 
yield a minimizing solution of the variational problem in x B. 

We then show, applying certain results of [2] , that the necessar 
conditions for minimum derived in [1] (generalizations of the 
Weierstrass E-condition and of the transversality conditions) re- 
main essentially valid in the present context. Our present 
results are limited to the case where T is the closure of a 
bounded open set in the Euclidean i-space E^ and y is 
absolutely continuous with respect to the Lebesgue measure (wherea 
in [1] T was only assumed to be metric and compact, with an 

This research was supported by N.A.S.A. Grant NGR 22-011-020. 
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appropriate measure); within this context, however, the present 
results generalize the necessary conditions of [1, Theorem 4.2] 
in that the given class of controls may, but need not, consist of 
relaxed controls and the remaining assumptions are also slightly 
weaker . 

References to other related work can be found in [ 1 J . 

2. Necessary conditions for minimum . Let T be the closure of 
a bounded open subset of E^ , R a metric space, B a convex 
subset of a real linear space, a closed subset of E^ , 

n > m , g = (g\...,g n ) , and (t,T,v,r,b) + g(t,T,v,r,b) : 

T x T x x R x b -* . We assume that g 1 ( t , t , v, r ,b ) = . 

g 1 (i,v,r,b) (i = l,...,m) are independent of t. 

For p: T ■+ R , p^: T R (i = l,...,k) and disjoint 

subsets of T , we define p = [p^ , (i = l,.,.,k); p] 

T + R by p(t) = (t) for t s A^ (i = l,...,k) and p{t) » p (t) 

for t £ T - A. . Let ft be any class of Lebesgue measurable 
i=l 1 

mappings from T to R with the property that, for every set 
A that is a finite or denumerable union of intervals in E^ , 
( p l, e ^' p 2 implies [p ± , A ; p 2 ] e ft. 

Let y' be a Lebesgue integrable scalar function on 

T (viewed as a subset of E^) , with y' (t)> 0 on T , and let 

y be a positive measure defined on the class of Lebesgue measurable 

subsets of T by the relation y (A) = J*y'(x)dT , where dx refers 

A 

to the Lebesgue measure in E^ . Let be either the Banach 

space L p (T , y ; E n ) (of functions from T to E n ) for 
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1 < p < co or the space C (T , E n ) (of continuous functions from 
T to E n ) , each with the usual norm. We consider the integral 
equation 

(2.0.1) y (t) = f g(t,T,y (x) ,p(x) ,b)y (dx) (t e T) 

T 

for (p,b) e B. A point (y , p ,b) e Jr** x B satisfying 

equation (2.0.1) is an "admissible" solution if (y ,y ,...y m ) e 
B^ (observe that y 1 (i - are independent of t) . 

An admissible solution (y,p,b) is a "minimizing" solution if 
y^ < y^ for every admissible (y,p/b). 

Our purpose is to derive conditions satisfied by a 
minimizing solution (y,p,b) that generalize the Weierstrass E- 
condition (the maximum principle) and the transversality conditions. 


We shall use the term "measurable" in the sense of the 


Lebesgue measure on E^ when referring to subsets of T or 

functions on T ,and in the sense of the corresponding 

product measure with respect to T x T. We represent by |a| 

the norm of an element of a normed linear space. If CC and 

^ are Banach spaces , rc% and x+h(x):r ^ , we define 

the derivative h x (x^) as a ^-^ near °P era tor from 7C to £ 

such that | h (x) - h (x^) - h x (x^)(x - x^) | = o(|x - x^| ) for 

all x e T . We denote by h , , h , h the derivative 

lx ,y ; x y 

and the partial derivatives, respectively , of a function 
(x,y) -► h(x,y) from a subset of a Banach space to a Banach 
space. The symbol I represents the identity operator on 
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#<X.3 ) the linear space of bounded linear operators from 

a Banach space 3C to a Banach space ^ with the metric topology 

induced by the operator norm, A° the interior and A the 

closure of A , and an array (oo 1 - I )(i , j » l,...,m). If 

h:% * B -► ^ , where ^ is a Banach space, we write 

Dh (x , b^ ; b - b^) for the one-sided derivative 

lim i(h(x , b, + a (b - b, ) ) - h (x , b, ) ) . 

a ++0 a 1 1 1 


Assumption 2.1. For every fixed choice of b , with elements 
b 1 ^ e B , the following conditions are satisfied: 


there exists 0 e (0 , l/m^ ] such that, for 
max ' ' 


( 2 . 1 . 1 ) 

- tT = {0°|O £ 0^ £ 0 } c E 2 » the function 

1 mav m z 


7(b e 


(t , t , v , r , 0°) -► g # (t , t , v , r , 0°) = 


m .... 

+ E e 13 (b 13 - b) ) : 

i . 3 = 1 

has a derivative with respect to 


g ( t , x , v , r , b 

T x t x e xRx^ 
n 

o # # # 

(v , Or ) everywhere, and g , g v and g^c* are measurable in 

Q 

(t , t) for every (v , r , 0 ) and continuous in (v , r 
for every (t , t) ; 


0 °) 


(2.1.2) (1) if = L P (T , y ; E ) then there exist 

measurable positive \p and on T x T and numbers a 

and $ such that 0 £ a < p -1 , 0 < 3 < p , 
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/ { / |)|i (t , x) | p y (dt)} 1/(p ~ 6) y(dx) < » , 

T T ° 

f { / |i|> (t , t) | p/(p 1 a) vi(dT)} (p " 1 a) u(dt) < », 

T T 1 

l f { / | ^ , ( t , t) | p y (dt) > 1//(p 1 a 'y(dT) < “ , and, for 

T T L 

all (t, T, v, r , 9°) eTxTxE n xRx»77 

| g (t , t , V , r , 6°) I < (1 + | v 1 8 ) i^ o (t , x) for 

| # # 

I § = g and g 0e 

and 

|g*(t , T , v , r , e G ) | « (1 + I v | 04 ) ( t , t) ; 

(2) if = C (T , E^) then there exist a compact set D 
in E containing {y(t)|t e T} in its interior and an 

! n f # 

integrable scalar ip on T such that, for § = g f g v , and 
# 

9 qO , we have 

i 

> |g(t, t , v , r , 0°) | < ip (t) 

for all ( t , t , v , r , 0°) eTxTxdxrxJ^. Furthermore, 

there exists a positive function h + 0(h) • such that 

# # 

lim 0(h) =0 and, for t, , e T and g = g and g v , 
h++0 

f Sup |g(t. , x , v , r , 9°) - g(t 2 , x , v , r , e n )|y(< 

d x r x y 

^ o ( | t 1 - t 2 | ) ? 
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(2.1.3) for k (t , x) = g v (t , t , y(x) , p (t) , b) 

on T x T f the integral equation 

w(t) = / k (t , t) w(t) u(di) (t e T) 

T 

has only the solution w(-) = 0 in . 


2.2. Resolvent kernel . It follows from Assumption 2.1 (and 
can be proven exactly as in [1 , Lemma 7.2]) that there exists 

★ ★ i 

a measurable real matrix-valued function k = (k j ) (i , j * 1,.. 
on T x t (a resolvent kernel of k) such that, for every h 
the relations 


w(t) = / k (t , x)w(x)y(dx) + h(t) 
T 


(t e T) 


and 


w(t) = / k (t , xUi(x)y(dx) + h(t) 
T 


(t £ T) 


are equivalent in 0 , k^ (t , t) = k^ (t) are independent of t 

for i » 1,. . # m , / { / |k (t , t ) | p y (dt) ^ y(dx) < 00 

T T 


for 


^=L P (T , y ; E n ) and / Sup |k"(t , x)|y(dx) < 


for 


t e T 
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= C(T,E n ). 

We can now state our basic results. 

Theorem 2.3 . Let (y , P , b) be a minimizing solution, let 

* 

Assumption 2 . 1 be satisfied, and let k be a resolvent kernel 
of k. Let be a denumerable subset of ^containing p , 

R*(t) = { p Ct ) |p £ ^H 00 ) (t £ T) , K 1 a convex subset of some 
Eg , t £ K 1 , and $ : K 1 -*■ a continuous mapping with 

a derivative at | and such that <{>(?) = (y\---fy m ) • Then 

either * Min ; 

5 5 £ K 1 * 

or there exist a nonvanishing X = (X^ , . . , A ra ) £ and 

y ^ 0 such that, setting 


X = (X 1 ,.. f X m ,0,...,0) = 

(X , 0,. 

. . ,0) e E n , 


m • * - 




C 3 (t) = E X x k * (t) + 

X j /V(T) 

(t e T , j = 1 , . . . ,n) , 


i = 1 3 




C(t) = (C 1 (T),..,C n (T)) 


{T £ T) , 


H. (r , t) = / C (t) ,.g (t , 

T , y(i) 

, r * b)p(dt) (t e T , 

- r e 


T 


and 


H 2 (b) = / C (t) -Dg (t , t ,y (x) , p (t) ,b ; b-b) p (dt) p (dt) (b e B) , 

z TxT 
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the following conditions are satisfied: 
(The Weierstrass E-condition) 


(1) H 1 (p(x) , t) = 

t in T , 

(Transversality conditions) 


Min * 
r e R (t) 


H l<r, t) 


for u-almost al 


(2) Min H (b) = H (b) =0 , 

b e B z 


and 


(3) (y<$ -i - X) $,-(£)£ = Min (y 6, - X)*<J> {£)£ , 

* Z e K 1 5 


where 6 ^ = (1 , 0,..,0) e . 


In particular, if 
constant functions from T 
R in relation (1) . 


R is separable and contains 


to 


we can replace R (t) 


all 

by 


3. Proof of Theorem 2.3. 


3.1 Regular sequences, admissible controls, the sets T. 

* * II 

and R (t ) . Let | A | = / dr represent the Lebesgue measure of 

A 

and S (A 


ACT , diam(A) the diameter of A 


6-neighborhood of A. A sequence {M.} 


j = 1 


, 6 ) the closed 
of closed subsets 
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of T is "regular" at t (covers t in the sense of Vitali 
[3 , p . 212 ] ) if diam(Mj) +0 as j + » f t e H . , and 
| S (Mj , 3 diam(Mj ) ) | * a |Mj | ( j = 1, 2 , . . . ) for some a > 0 . 

For any y-integrable function x -*■ f (t) from T into 

* 

some Banach space, let T^(f) be the set of all the points t 

i * i 

in T such that |f(t ) | < °° and 

a = lim — / f(x)y(dx) = f (t* ) 

Vt (Mj ) M 

for all sequences } that are regular at t . Since 

a = lim r / f (x)y' (xJdxA-i-. f y'(x)dx , it is well 

j + °° ' j ' M. 1 j 1 M. 

J J : J j 

known (proof as in [3,Th. 8, p. 217]) that |T^(f)| = |t|; hence 
v(T'(f))- y(T). We write T'(f x , ± 2 , ) for 

T u (f l } ^ T l^ 2 )n • ■ • • 

If L P (T , y ? E n ) then it follows from Assumption 

2.1 that, for all p e , the function g(p) defined by 

g(p)(x) = g ( * , x , y (t) , p(x) , b) (x e T) 

is a y-integrable function from T to L P (T , y ? E^) . We then 
set 

(3.1.1) T* = H t'(g(p) , g ( p ) ) 0 T° . 

P e *K 
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If ^ = C(T , E n ) then, for all p e a , the functions 
g-^p) and g 2 (p) from T to E m / defined by 


g^(p)(T) = g 1 ^ , y (t ) , p (x) , 


b) 


(i = 1 , . . ,m,T e T) 


and 


gi (p) (t) = / 
1 T 


n 

£ 


k* 1 (t)g^ (t 
1 3 


T 


y (T) ,p ( t) , b)y (dt) 

(i = 1, . . ,m , t e T) 


A 

are y-integrable (since t-Hc ^(t) are y-integrable on T for 
i = and |g(t , t , y(x) , p(t) , b)|< iP(t) for all 

t , t e T) . We then set 


(3.1.2) 


n 


P e 


ft 


T ' (§i<p) ,g 2 (p) »§ i<p) ,g 2 (p) »’(’) A t°. 


Thus, in both cases, |T*| = |t| and y(T*) = y(T). 


* * 

We also set, for each t e T , 


R (t ) = {p (t*) | p e $2^} . 


3.2 The collection JY and the function G . Let 

N. = , (2 _1 , 2 -i + X ] (k = 1 , 2 , . . , m 2 ) , 

0*i=k(mod in ) 

6>0, [0,6] j = [0,6] x...x [0,6] (j times), 

N k6 = (N k^ [0,6])x[0,6] 4 " 1 , and Y k (t) = Sup v ( (t+N^nT 0 ) 


(k— 1 , , 1 
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We define (t , a) (t e T° , k = 1, ,m 2 , a > 0) as 

(t + N k ^) f~) T° , where fS is chosen so that u(N k (t,a)) = Min (a , y k (t)} 

We set = {N k (t , a) |t e T* , a > 0} (k = -1 , 2 , . . ,m 2 ) 

and t^= {^|k = 1,2 m 2 ). 


\ 

I 

I 


We can easily verify that, whenever ^ a j^j - ± 

sequence decreasing to 0, the sequence (N, (t , a.)} 0 ? 

K 3 3 

* o 2 

regular at t for every t e T C T and k=l,2,..,m . 

For any fixed choice of t^ e T , e , 

(i/j - l,..,m) , let 3*=^ { h ff ) (as in Assumption 2.1) 


is a 
■ 1 is 

and c 

, and let 


B 


Q = n ( t 4 *) 


{gj* I * 0 (i,j = 1 


,m) and the sets 


N mj - m + i^ 1 ^ ' w 1 - 5 ) are disjoint } . 

We set p ' (u) ) = , N m j _ m + i (i , j = 1, . . . ,m) ; p] and 

# 

define g as in Assumption 2.1. Finally, we verify as in 
[1 , proof of Lemma 7.1] that there exists a neighborhood 
T = Ty x x T 0 Of (y , 0 ,0 * ) in * A * \f such that the 
relation 


G(y ,o) ", 0 *) (t) = / g # (t,T,y (t) , p' iw # ) ( t ) ,6*)p (dx) (t e T) 
T 

defines a mapping G: T ■+ 

Lemma 3.3. The functions (y,o>*, 6*) -► G (y , a) 6 *) : 
r v and (y,u a ,e < ') -»■ G y (y ,w" , 6 C ) : r 
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i 

i 


exist and are continuous. We have, for all (y , to a , 0 e T and 
iy e ^ , 

(3.3.1) (G (y ,U)“ , 6°) Ay) (t) = / g* (t,T ,y (t ) , p ' (o'") ( t ) , 6°) Ay (t ) y (dT> 

Y T v 

(t e T) . 

Proof . The arguments of [1 , Lemma 7.1] show that, for fixed 
to a , the function (y,0*) •* G ^(y , to®, 6* ) : x ) exists, 

is continuous, and satisfies relation (3.3.1). Now let to* and 
to* be in and set 2 = { e T|p'(to*)(t)fp'(to 2 )(t)}. Then 

y (M, 2 ) * 2 | and, by (3.3.1), 

' i/j = 1 

( (Gy (y f to*, 6* ) - G y (y , to* i B 4 ) ) A y) (t) = 

- f (g* (t,x,y (x) ,p' (w*) (t) ,0') 

M. - V « * 

' L ' - g* (t , t ,y (t) f p ' (top (t ), 0) ) * Ay (t) y (dT) 

((y,a/,0*) e T , Ay e p , t e T) • 

For L^(T,y;E n ), we have, therefore, in view of Assumption (2.1.2), 

A 1 ,2 = I (G y (y ' W 1 ' q/,) “ G y (y f0) 2 J A yl p * S 2 1 ( • , x) | p | Ay (x) | y (dx) 

M l,2 

where the function x -*• l^l^*' T ^ip = { / | ^ (t , x ) | ^y (dt) } belongs 

to L p// ^ (T , y ) ; hence 
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h 6 1 if I'M* |P /(p_1) y (dT)} 1-1/ P • | Ay | . 

' M J. p p 


Since u (M^ 2 ^ ^ ^ as u 2 w l" ' ^ follows that A^j/I^ylp 
also converges to 0, uniformly in (y,0 ) • Thus, when 

L P (T,y;E n ), the function a) 9 ■> G y (y , u>*, 0* ) : 
is continuous, uniformly in (y,0*) , and we conclude that 

the function (y,o>*,0*) + G y (y ,u*, 0*) is continuous in ^x Q x 

For C (T , E^) , we have 

I ((Gy (y*w',e*) - G y (7,^2 ,e tf ) ) ) Ay) (t) |< / M ^(T)u(dx) • I Ay| ro , 

1,2 

and the argument can be continued as in the previous case . QED 

Lemma 3.4 . Let L p (T,y;E n ). For fixed y e T y , the function 

(co*,0*) -► G Q * (y ,co *, 0 *) : x r Q $ (E^ ^/) exists and 

is continuous, and we have 


(3.4.1) 


G q a (y,w* ,0 r ) 


/ g!o (• ,T,y (t) ,p' (to*) ,0® )y (dx) (t e T). 
T 0 


Proof . The existence of G Q o(y,w*,0 # ) , relation (3.4.1), and 

the continuity of 0° + G^a (y , co °, 0* ) follow from Assumption (2.1) 
and, in particular, the continuity of 0 # g 0 „ and the bounds. 

The continuity of oo* ■> (y, u a t §°) , uniformly in (y,0*) , 
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can be shown by the same arguments that were applied in Lemma 3.3, 
with G 0 <* replacing G y . QED 

Lemma 3.5. Let ^ - L p (T,y;E n ) and let 0 a be the origin 
of E m 2 . Then the function (y,u)*, 0*) -+■ G(y,a>*, 6*): T + ^ has 
a (total) derivative at (y,0*,0*) (relative tor), and 


(3.5.1) j (y » 0*,0* ) = g(-,t ij ,?(t ij ) ,p ij (t ij ) ,b) - 

- g(* ,y (t 1 ^) ,p (t 1 ^) ,E) in 

Proof . If follows from the bounds in Assumption (2.1.2) that 

Ji 

T Ig (• ,T,y ( t) ,p' (w*) (x) ,0 # ) | is y-integrable for all 
(y,o) # , 0*) e T . Thus 


G(y,oj # ,e°) 


/ g # (* ,x,y (t) ,p' (w*) (x) ,0*)y (dT) 
T 



and 


G(y,u',0*) - G (y , 0 **, 0 ) = Z / (g ( * , x , y (x ) , p 1 j (x ) , b ) 

1 ' j =1 M ij 


“ g ( • r x ,y (x) ,p (x) f b) )y (dx) 


for M ±j = M i j(co lj ) = N mj _ m+i (t lj ,w lj ) (i , j-1 , . . ,m) . Since, 
for each fixed i and j , the sequence {M d^ a k^ ^k=l * s 

regular at t 1 - 3 if -► + 0 , the are disjoint, t ^ e T* and 

y (M : ) =y (M^ = oj 1 * 3 for sufficiently small w*- 3 (i , j*=l , . . ,m) , 
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relation (3.5.1) follows directly and we also conclude that 
ui * + G (y , oj 0 *) has a derivative at 0*. 

Since, by Lemma 3.4, (u) # ,0*) -> G^* (y , /, 0*) : x r Q ■> 53 (E m2 r^) 

i ^ 

is continuous and there exists a convex neighborhood of 
(0*,0*) relative to x r e • we conclude that the function 

(oo°,6 a ) + G(y,u) D ,0 o ) has a derivative at (0*,0 tf ). Finally, 

I by Lemma 3.3, (y,co 4, ,0^) + G^ (y , w*, 0*) : T ) is 

| continuous; hence (y , uj d , 0° ) ■►G (y f t») 0 , 0 ) has a derivative at 

i (y , 0° , 0° ) . QED. 


3.6 Proof of Theorem 2.3 for 


L p (T,E,) (l<p<°°) . 


By Lemmas 3.3 and 3.5, the function (y,w # ,9*) -► G(y,w* f 0^): 
r ^ has a continuous partial derivative with respect to y and 
a (total) derivative at (y,0^, 0^) . Furthermore, Assumption (2.1) 

implies (see [1, Lemma 7.2] for details) that I-G (y , 0 n , 0*) is 
a linear homeomorphism of onto ^ and that 


(3.6.1) 


(I-G (y,0 9 r O 0 ))~ 1 = I + K* , 


where 


(3.6.2) (K Ay) (t) = j^k (t ,t) Ay (t ) y (dx) for all t e T and Ay ejf* 

It follows then from a variant of the implicit function theorem 
(using essentially the same arguments as in [4 ,p . 265]) and from 
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the representation of Gy,G Q <* and c in Lemmas 3.3, 3.4, 
and 3.5 that the equation 


y = G (y 6 tf ) 


has a unique solution 
in 2^ for all 0*) 


n (u B ,e®) = (n 1 (to',0') , 

in some neighborhood 


in T x r such that the function (tu ,0 ) 
w y 

continuous in A and has a derivative at (0 


...,n n (u*,e')) 

A of (0°,0°) 
n (w , 0 ) xs 
*,0*) , and 


(3.6.3) 


Vi (0 ' 0#) 


(I-G y (y-O'o") ) _1 (g(- (t 1 ^) ,b) 

g(- ,t ij ,y(t ij ) ,p(t ij ) ,b) y t 


(3.6.4) n 0 ij (o',o') = (I-G y (y,0 < ’,0°)) -1 /^gg ij (-,T,y(T) ,p(T)p")y(d 

for i,j=l,..,n. 

Now let a function (p,b) -► ^p,b): % * a - E m 

be defined as follows: 
if the equation 


y (t ) = f g{t,x ,y ( t) , p ( t) , b)y (dx) 


T 


has a unique solution y(*) 



we set 


(t e T) 
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x(p,b) = (y 1 ,...,y m ) 

(remembering that y 1 (i=l , . . , m) are then independent of t) ; 
otherwise we set 

x(p,b) = (y 1 + 1,0, ... ,0) . 

We also set, for all t* e T* , (t*) = . 

i a - m . . . . 

We observe that, in particular, x 1 (p ' (<d ) ,b + I (b^-b ) 

I , j— I 

= n 1 (u) tf ,0°) in A (i=l,...,m). 

We can now verify that (p,b) yields the minimum 

of x 1 (p,b) on {(p,b) e ^ x B|x(p,b) e , that (T 

define "local variations for x in n x B at (p ,£) " 

according to [2, Definition 2.1, p. 644], and that Theorem 2.2 

of (2, p. 644] is therefore applicable. Furthermore, defining 
Dx(p,b;t*,r) as in [2,p. 643] , we have 

(3.6.5) Dx i (p,b?t ,r) = n (0*/ 0* ) (i=l,..,m,t e T , r e R* 

where n is defined by choosing t^ - = t , p^ such that p^ (t ) = r 
and the other t 1 - 3 ^ 1 ^ and b* arbitrarily; and 

(3.6.6) Dx 1 (p ,b;b-*b) = riQl^l (i=l,..,m,b e B) , 

11 a o 

where r\ is defined by choosing b 9 and t ,p and the 

other b ^ arbitrarily . (The symbol Dx^ (p,b?b) in the notation of [2] 

corresponds to Dx 1 (p,b?b-b) in our present notation). 
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It follows, by [2,Th. 2.2, p. 644], that either 
the first alternative of Theorem 2.3 is valid or there exist 
a nonvanishing X in E^ and y > 0 such that condition 
(3) of Theorem 2.3 is satisfied, 

(3.6.7) X • Dx ( p , b ; t ,r) * 0 for all t e T and r e R (t ) , 
and 

(3.6.8) X * Dx (p ,b ;j3-b) ^0 for all b e B . 


Relation (1) of Theorem 2.3 now follows from (3.6.7), 
taking account of (3 . 6 . 1 ) , (3 . 6 . 2 ) , ( 3 . 6 . 3 ) , and (3.6.5). 

Similarly, relation (2) follows from (3.6.8), in view of relations 
(3.6.1), (3.6.2), (3.6.4), and (3.6.6). 


It now remains to verify the statement that R (t ) can 
be replaced by R if R is separable and contains all 


constant functions from T to R. In that case we can choose as 




containing p and a set of constant functions 


from T to R whose images form a dense subset of R. Then 


R (t) = R for all t e T and, since r ■> H^(r,x) is continuous 


for all t e T, we conclude that Min 


Min * H, (r , t ) = Min H (r,r). Q] 
r e R (t) r e r 


3.7. Proof of Theorem 2.3 for y- C(T,E^ ). The proof of Theorem 2.3 


for Of = L p (T,p;E n ) partly relied on the observation that 
G(y, 0)^,0^) is the y - integral over T of the function 
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T -*■ g # (• ,T,y (t) ,p (t) ,&•') : T -*■ ^ . For ^=C(T,E n ) this need 
not be (and in most cases of interest is not) the case since 
g^ ( • * t , y (t ) , p (t ) , 0* ) is not assumed continuous for fixed 
x and 0* . We can circumvert this difficulty, however, since 
we are primarily interested in the first m components of y, 

and (y 1 , . . ,y m ) is constant for every solution y of 

0 0 

y = G(y,w ,0 ). This remark motivates the ensuing arguments. 

By Lemma 3.3, the functions G and G exist 

y 

and are continuous on r. It follows, therefore, by the implicit 
function theorem, that the equation 


y = G(y,w*,0*) 


has a unique solution ri(^^9 tf ) in for (aj*,0*) in some 
neighborhood A of (0*,0 tf ) in x r Q and that (w a F 0*) -*■ n(w *, 0* 

A -+ ^ is continuous . 

Let K = G y (y ,0*,0*) , K* = (I-K) _1 -I, and let P m 

be the projection operator (a 1 ,..., a 11 ) = a ■* P • a = (a^,..,a m ): 

E -► E 
n m 

Lemma 3.7.1. The function (to 47 , 0*) ■> P ' (I+K ) • G (y ,w a , 0* ) : 

m 

A -*■ E has a derivative JJ 111 ) at (0*, 0*) , and 


(3. 7. 1.1) (u a ,e°)=(fq*j (x,y (t) ,p(x) ,0 - )w(dT) + / k* 1 (t) ■ 


TxT 


g** (t ,t ,y (t) , p (t) , Q* )y (dt) y (dx) ) • 0 + 
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I J, =1 (g 1 (t k *,y (t kA ) ,p (t k<l ) ,b) + / k^ct)- 
' T 

g(t,t k \y(t k *) f p(t k,l ),b)y(dt))u) k)l U=1 , . . . ,m,a>eE m2 , 6 # e E m2 ) , 

*i 

where k = (k'^,...,k ). 

Proof. We set H (uj a , 0*) = P^(I+K*) -G (y , u> # , 0 *) and verify that 

= / g # 1 (T,y(x) ,p' ( 00 *) (x) ,0*)y (dx) + 

T 

+ f k 1 (t)*g # (t ,t ,y (t) , p' (w^) (t) ,0*) y (dt) y (dx) 


Assumptions (2.1. 1) and (2.1.2) imply that (m^e*) -► H 0 # (to*, 0 # ) : 
A +^(E m 2/ E m ) exists, is continuous, and 


H e *(0*,0*) -0* (O*,0*) , 


where 


J0 


is defined as in 


(3.7. 1.1) . 


We also observe that 


i , D n i 0 0 

H 1 (u ,0 ) - H 1 (0 ,0 ) 


m . _ _ . 

1 / (g 1 (T,y (t) , p kK (t) ,b)-g x (r,y (t) ,p (x) ,b) ) y (< 

k '* =1 " k * 


*i — VP— _ _ 

+ E f y(dx)/ k (t) • (g(t,T,y (t) , p (t) ,b)-g(t,T,y (t) , p (x) ,b) )y (dt) , 
k,t=i T 
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\r fi Ir 0 kfl, * 

where = N mA-m+k (t , to ) . Since t e T for k, 1=1 , . . , m, 

* 0 0 

it follows from the definition of T that H^ttO *0 ) exists and 

V (0-,0Vu)' • (to o'") . 

We conclude that (0°,0 fl ) = QED. 

Lemma 3.7.2 . There exists a constant c such that 

Sup | G(y ,o> # ,e') (t) - y(t)| < c { | w*| + 1 0 *\ ) 
teT 

for all (uj*, 0*) sufficiently close to (0°,0 D ). 

Proof . We have y ( t ) = G(y,0^,0^) (t) (teT) and , for all teT, 

| G (y , u)*, 0 *) (t) - y (t) |<|/ (g # (t,x,y (x) ,p' (O (x) ,0 - g # (t ,x ,y (x ) , p ' (oft ,0 

T v(dT)| 

+ | l f (g* (t ,x,y ( x ) ,p k& (t) ,0* ) - g* <t,x ,y (x) ,p (x) ,0*) )y (dx) | 

=«= a + b , 

k C k5 

where (t ,u> ) . We observe that 

a < f T (x) y (dx) • | 0°| 

# — 

because | g* 0 (t ,x ,y (t) ,r , 6 )| <^{x) everywhere, and that 


i 
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b < 2 Z f (x)y (dx ) 
k , £=1 

ji _ 

because | g # (t , t , y ( t ) , b ) | < ^ (t ) everywhere. The conclusion of 
the lemma now follows directly, remembering that 

1 k Q 

lim -r rrf ^(T)y(dx) = (t ) (k , &=1 , . . . ,m) 

V o * 

since t e T C (ip) (k , 1=1 , . . ,m) . QED 

3.7.3 Completion of the proof . We now observe that, for 
w = (ui°,0*), 0^ = (0*,0*), and for all w e A , 

n (w) - G(y, w ) = G ( n(w) , w) - G (y ,w) = G y ( n(w) ,w) • ( n (w) - y) ; 

hence 

n(w) - y = (I - G y (n (w) ,w) ) _1 (G (y ,w) - G(y,0')), where 
rj(w) e [y, n(w)]d^. Thus 

(3. 7.3.1) P m . ( n (w) - n (O' ) - (I + K* ) * (G (y ,w) - G(y,0'))) 

= P m * ( ( d-G y ( n (w) ,w) ) _1 - (I -Gy (y , 0 ' ) ) _1 ) (G(J,w) 
- G(y,0'))). 

Since w -*■ r\ ( « ) is continuous, and so is, by Lemma 3.3, 

(y,V*) Gy(y,w), it follows from Lemma 3.7.2 that the right 
hand side of (3. 7. 3.1) is o(|w|). Thus, in view of Lemma 3.7.1, 
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the function w -*• P • n(w) has a derivative at 0' and 


( n 1 ' • • » n m > . » " <V r,, w (0 ' ) = V (I+K >* 

(W , 0 ) 

•' G ( U *,e') 'y' 0 "' 0 ") m P' 

where is defined by relation (3.7. 1.1). 

We can now complete the proof of the theorem exactly 
as we did in 3.6 for t L p (T,y;E n ). QED 
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APPLICATIONS OF HAMILTON-JACOBI THEORY 
TO PLANAR TRAJECTORY OPTIMIZATION* 

By S. K. Lakhanpal 
Vanderbilt University 
Nashville , Tennessee 


SUMMARY 


The purpose of this paper is to study the application of Hamilton- 
Jacobi perturbation methods to the determination of the minimum fuel 
trajectory of a rocket moving in a plane under a central gravitational 
force and the thrust of an engine. First, a brief survey is given of 
the needed theorems from the calculus of variations and Hamilton -Jacobi 
theory. The problem is then formulated analytically and the multiplier 
rule and Weierstrass condition applied. The Hamiltonian is separated 
into base and perturbation parts. Two methods are given for obtaining 
a complete integral of the Hamilton-Jacobi partial differential equation 
for the base Hamiltonian. Jacobi's Theorem is applied to give a system 
of canonic constants for the base problem. The procedure for using these 
constants as canonic variables in the perturbing Hamiltonian is then 
developed. 


INTRODUCTION 

Many trajectory optimization problems are of the Mayer type in the 
calculus of variations, the classical theory being easily extended to in- 
clude control variables. (See, for example, Hestenes, Ref. [l or 2]). 

With differential constraints in normal form, the multiplier rule gives 
equations of extremals as canonical equations of a generalized Hamiltonian. 
Jacobi's theorem then gives a method of solution based on finding a com- 
plete integral of a partial differential equation. This theory is 
summarized briefly, without proofs, in the first part of this paper. 

Low thrust rocket trajectory problems are analogous to perturbation 
problems of planetary theory, the thrust of the engine being considered 
as the perturbing force. William E. Miner [ 3 ] has developed this method 
extensively for three dimensional trajectories. The object here is to 
consider the simpler planar case and to study alternative methods of solv- 
ing the partial differential equation of the base Hamiltonian in an effort 
to discover some simplifications. 


*This research was supported by NASA Research Grant NGR-^3-002-015 and was 
done under the direction of M. G. Boyce. A part of it was included in the 
author's master's thesis in mathematics at Vanderbilt University. 
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A planar rocket trajectory problem is formulated with end conditions 
allowing for various missions, including rendezvous with a satellite in a 
coplanar orbit. The base Hamiltonian is taken as the part not involving 
thrust. The partial differential equation for it is linear, and our first 
solution uses Lagrange's method for obtaining a complete integral. Jacobi's 
equations determining original variables in terns of canonic constants are 
used to eliminate the original variables from the perturbing Hamiltonian, 
the canonic constants becoming new generalized coordinates and momenta. 

The canonical equations of the new Hamiltonian are then the differ- 
ential equations of the extremals. 

The second method of solving the partial differential equations for the 
base Hamiltonian is to first transform it by a canonical transformation of 
variables and then use Jacobi's method to find a complete integral. The 
procedure described above is then repeated. 


HAMILTON -JACOBI THEORY 
Mayer Control Problem 

The Mayer problem of calculus of variations involving control variables 
may be expressed in the following form. 

The problem is to find in a class of admissible arcs 

y i(t)> t o - t - 1 = ’*•> n > <5 = 1 f *•*> m > 

satisfying differential equations and end conditions 
y ± = f t (t,y,u), 

J k ( V y(t 0 ) > t i f Y ("^! ) ) = k = 1, •••, p < 2n + 2, 

one which will minimize a function 
J(t Q , y(t Q ), t^, y(t i )) . 

Here, in the arguments of the functions, y denotes the n-vector 

y . • • • * y and u the m-vector u , • • • , u • The super dot denotes 
i 7 n l m 

derivative with respect to t . Partial derivatives will often be denoted 
by subscript variables and summation by the tensor analysis device of re- 
peated indices. In this study admissible arcs will be arcs whose elements 
(t, y, y) lie in a given 2n + 1 dimensional open region R and whose 
control variables u are in an open region U . The end points 
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(t Q , y(t Q ), t , yCt^)) of admissible arcs are required to lie in an open 
set S, and y, y, u are continuous functions of t. The given functions 
f^, J are assumed to have continuous partial derivatives in their 
arguments to as high as second order. 


First Necessary Condition: Multiplier Rule 

The classical first necessary condition can be stated for the Mayer 
problem with control variables in the following form. [4] 

Theorem 1 . An admissible arc E is said to satisfy the multiplier rule if 
there exists a function 

H(t,y,u,\) = i = 1,2, ••• ,n, 

where \'s are functions of t not simultaneously zero and continuous 
along the arc E, such that the equations 

(1) \~ 0 ’ = 


are satisfied, if the end point conditions = 0, k = 1, •••, v, hold, 
and if the transversality matrix 


H(t o ) + -H^) + -X t (t 0 ) + X 1 (t i ) + 


kt 


kt 


J kyi(t 0 ) J ky i (t i ) 


is of rank p. Every minimizing arc must satisfy the multiplier rule. 

Solutions of equations (l) are called extremals , and equations (l) 
are called the canonical equations of extremals . They are the Euler - 
Lagrange equations for the problem, and the function H is analogous to 
the Hamiltonian of mechanics. If, for admissible arcs, y and u are 
assumed only piecewise continuous, then Theorem 1 holds between corners 
of E. 


Weierstrass Condition 


The Weierstrass condition for the Mayer control problem can be 
stated as follows. [4] 
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Theorem 2 . Along the minimizing arc E, the inequality 

H(t,y,x,u) < H(t ,y,\,u) 

must hold at each element (t,y,X,u) of E for every u in U. 

Thus H(t,y,X,u) is a maximum with respect to the control variables 
for a minimizing arc, for which reason this condition is often called the 
Maximum Principle. 


Elimination of Control Variables 


An arc along which the determinant H 


W 


^ 0 is said to be non- 


singular. It will be assumed that all arcs considered are non-singular. 
The equations = 0 can then be solved for the control variables in 

J 

terms of multipliers and state variables, and control variables can be 
eliminated from the Hamiltonian. This will be supposed done, and the 
Hamiltonian will be written as 

H *(t,y,x) = H(t,y,u(t,y,x),\)« 

It follows that the canonical equations of the extremals can be ex- 
pressed in terms of H* . For, if the equations 

H u = °> 0 = 1, • • * ,m, 

<3 

of the set of equations (l) can be solved for u. = u.(t,y,x), then 

H* = H + H u. , 

*1 y i U j Jy i 


= H + H u. 


U j jX i 


Since H =0, it follows that 

U j 
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or 



and 



Hereafter H*(t,y,x) will be denoted by H(t,y,\) because of the equiva- 
lence of the two Hamiltonians. 


The Hamilton -Jacobi Equation 

The partial differential equation of first order 
(2) S t + H(t,y,S y ) = 0, 

is called the Hamilton-Jacobi equation . It has dependent variable S and 
n + 1 independent variables t,y • The complete solution of ( 2 ) 

will have n + 1 arbitrary constants. However, one is additive and is of 
no importance here, so we shall consider a solution with n independent 
constants, no one of which is additive, to be a complete solution. 


Theorem 5 . Let the Hamilton-Jacobi equation ( 2 ) have the solution 
S = S(t ,y , • • • )y n > a )”* * a m ) depending on m (< n) parameters <3t^, • • * ,Q^. 
Then each derivative is a first integral of the canonical Euler 

J 

equations system 


y = H , = -H : 

*1 \ ± ’ i y ± 9 


that is, = constant along an extremal. 
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Jacobi’s Theorem 

Theorem k . Let S(t ,y ^ 9 ■ • • * * * > a n ) be a complete integral of the 

Hamilton -Jacobi equation (2), that is, a solution depending on 
n-parameters an( ^ having the n by n determinant 

Also let be n arbitrary constants. Then the functions 

(3) y ± = i = 

defined by the relations S a = £., together with the functions X. = S , 

i 1 y i 

constitute a general solution of the canonical system 

*i“\ ’ h= - H y. ' 1 = 

For proofs of theorems 3 and k see [5, p. 90]. 


S ^ 0. 

a,y: 


Hamilton -Jacobi Perturbation Theory 


In celestial mechanics the path of a planet is disturbed by the pres- 
ence of other heavenly bodies. This disturbing force is very small compared 
to the attraction of the sun. The Hamiltonian is expressed as a sum of two 
parts; the one which corresponds to the motion of the planet without the 
disturbing influence is called the base Hamiltonian, and the one correspond- 
ing to the distiirbing factor is called the perturbing Hamiltonian. The low 
thrust rocket problems in trajectory analysis can be treated in a similar 
way, the thrust of the engine being considered as the disturbing factor. 


The following theorem shows how to obtain a complete integral of 
order n of the Hamilton -Jacobi equation for the base Hamiltonian in case 
it involves fewer than n s [6, p. 29]- 

Theorem 3 « Let H(t ,y_^, • * • ,y n ,\ , • • • ,X Jl ) be the Hamiltonian for a 

dynamical system. Let H = H (t,y ,"*,y where k < n, be 

001 n x iv. 

the base Hamiltonian and let S*(t,y ,«*«,y ,<2 , ) be a solution of 

1 n 1 k 

the Hamilton -Jacobi equation for H q depending on k independent para- 


meters (0£ , • 




with 


y.a. 

*1 J 


^ 0,i , j = l,2,***,k. Then 
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n 

s = s*(t,y i ,-",y n /V-’ < \) + }- Vi' 

i=k+l 

where (Q^ +1 , • • • ,0^) are independent parameters, is a complete solution of 
order n for the base Hamilton -Jacobi equation. 

From Theorem '4 it follows that 
(4) Pi ~ Sq, , = S , i=l,-«*,n. 

i 

We solve these equations for y's and \'s in terms of Of's and P's, 
thus = y^Q^pjt) and = \^(a,p,t), and substitute these values in 

the perturbing Hamiltonian, say H . Now is expressed in OS's and 

p's as variables. 

On considering S to be a generating function for a canonical trans- 
formation with CK's and p's as new variables, it follows that the new 
Hamiltonian is + H, [5, p. 79] • But 

S + H = S, + H + H , and S + H =0 
t t o i* to 


when S is a complete integral of the Hamilton -Jacobi equation for the 

base solution. Hence the H is the Hamiltonian for the total problem 

l 

in terms of the variables Q£ , p_^ ; and the canonical equations for 
extremals in these coordinates are 


<X = H 
x l Pl 


e. = -h 


The solution of these equations gives the extremals for the problems with 
2n constants of integration [6, p. 27 > 7> P- 137]. By the use of the 
set of equations (4) we can express the trajectory in terms of y's and 
t. This theory can be extended to splitting the Hamiltonian into more 
than two parts. 


Canonical Transformation 

Suppose the variables y's and \'s are transformed to new vari- 
ables q's and p's. If the transformation has the property that for 
every Hamiltonian H(t,y,\) there exists a function K(t,q,p) such that 
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. BK 


P = 


-dK 


1 , . . . ,n, 


then the transformation is canonical. 


M. Let N denote the matrix 


It is assumed that the transformation has a non-vanishing Jacobian 

0 l“ 

q of order 2 n, then the nec- 

_** n 

essary and sufficient condition that the transformation be canonical is 
that m t um = cN, where c is a non-zero real number. [8 ]. 


PLANAR TRAJECTORY OPTIMIZATION PROBLEM 

A rocket moving in a plane under a central gravitational force and 
the thrust of an engine is to achieve a specified mission starting from 
a given initial state. The variable angle of thrust, which is a function 
of time, is the control variable. It is desired to find the equations 
of the path requiring the least amount of fuel. 


Assumptions 

The path of the rocket is assumed to be in a plane, and hence a polar 
coordinate system is used, with origin at the center of the earth. The 
coordinate system is fixed relative to the earth and the gravitational 
force on the rocket is assumed directed towards the origin. The rocket is 
considered as a particle of variable mass. Air resistance is assumed 
negligible and thrust magnitude to be proportional to a constant rate of 
flow of mass. 


Equations of Motion 

Let (r,0) be the polar coordinates of the rocket, Ot the angle 
between the radius vector and the direction of thrust, P the thrust magni* 
tude, m the mass, c the constant rate of mass flow, and k the 
gravitational constant. The equations of motion can then be expressed as [9] 

r - r O 2 =-k/r 2 + (F/m) cos Of, 

( 5 ) r0 + 2r0 = (F/m) sin Of, 

m = -c. 

The theory of the Lagrangian 

L = (r 2 + y 2 ¥)/2 + k/r. 
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for the unit mass two-body problem without thrust, suggests defining 
u = dL/dr, w = dL/d0. 

Thus u = r , w = r 2 0, 

and u is radial velocity, while w is rv, where v is tangential 
velocity. The equations of motion (5) then become 

d = w 2 /r 3 - k/r 2 + (F/m) cos oc , 

(6) d = (r F/m) sin a, 

r ■ u, 6 - w/r , m = -c . 


Let the initial and terminal conditions be denoted by J K = 0, K = 1, • • • ,p 
<12, in the notation of the general problem in the first part of this 
paper. For a minimum fuel trajectory, the function to be minimized can be 
expressed as J = m(t Q ), with m(t^) a given constant. If the initial 

position and velocity are given, we have 


J 5 t 

1 O' 


J 2 S u(t o ) - V J 3 = v(t„) 


J = r(t ) - r 
4 o' o 


6(t o ) - e o 


For terminal values, J = m(t ) - m , and the remaining J’s would be 
6 11 


functions of t , u(t^), w^), r(t^, m(0. 


Elimination of Control Variable by Weierstrass Condition 
The Hamiltonian for equations (6) is 
H = \ (w 2 / r 3 - k/r 2 + (F/m) cos Of) + x (rF/m) sin a + Xu + X w/r 2 - c\ , 

1 2 3 4 o 

where the X's are functions of t not simultaneously zero. 

From the Weierstrass condition, H, as a function of a, must be a 
maximum. Hence H^ = -X^CF/m) sin Of + X^rCF/m) cos Of - 0 and 

H oa = cos a -X 2 r(F/m) sin a < 0 . 

It follows that 

tan a = rx /X ,sin a = rx /nT (x 2 + r 2 X 2 ) , cos a = \ /sT\ (x 2 + r 2 X 2 ) 

2 1 212 112 

the radicals being positive because of H^ < 0 . 

Elimination of a gives H = H q + H , where 
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H q = X (v 2 / 1 ® - k/r 2 ) + w/r 2 - cX s and = (F/m) 'f (x 2 + r 2 X 2 ) . 

The Hamilton -Ja eobi equation for the base Hamiltonian H q then is 

(7) dS/dt + (w 2 /r 3 - k/r 2 ) dS/du + udS/dr + (w/r 2 ) dS /d0 - cdS/dm = 0. 


Detei*mination of a Complete Integral 

In seeking a complete integral, apply separation of variables, letting 

S = S (t) + S (0) + S (m) + S (u,r) . 

12 3 4 

The Hamilton-Jacobi equation assumes the form 

dS /dt + (w 2 /r 3 - k/r 2 ) SS /du + u£S /dr - (w/r 2 ) dS /d0 - cdS /dm = 0 , 

1 4 4 2 3 

which does not involve t, 0 and m explicitly. Hence 

dS /dt = OL , dS /d0 = a , dS /dm = a 
r 12 2 3 3 

where a , a , a are arbitrary constants. 

12 3 

The Hamilton-Jacobi equation can now be written as 

(8) (w 2 /r 3 - k/r 2 ) dS^/du + udS^/dr = cQ^ ~ a 1 ’ a 2 v /r 2 > 

which is in the form of Lagrange's linear equation 0.0, Ch.XII], and its 
subsidiary equations are 


w 2 /r 3 - k/r 2 u c a - a - a w/r 2 

11 3 12 

From the first subsidiary equation we get 

(9) u 2 - 2k/r + w 2 /r 2 ~ -a 2 . 


which we write as 

f = , 

where -a 2 is a constant of integration with the sign chosen so as to give 
a periodic trajectory. 

On substituting from the above for u in the last subsidiary equation, 
we have 

dS = » cQ 3 - «i) r 2 - Qtg w) dr 
4 r n/- a 2 r 2 + 2kr - w 2 
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the ambiguous sign of the radical being absorbed in the arbitrary constants 

a a , a . Integration now gives 
12 3 


S = 

4 




, ^ 2 , Ak . -i s 2 r 

r + 2kr - w + — sm 

3 


2 2 


_= - cc sin 

■/k 2 - a 2 * 2 2 


-i kr - w 

Wk 2 - a ^ 2 

+ b, 


where b is a constant of integration and A = cC^ - a . 

On eliminating a by use of (9) and introducing the abbreviating notations 
X 2 = -w 2 + 2kr - u 2 r 2 , 

Y 2 = (kr - w 2 ) 2 + u 2 ^ 2 , 

Z = kr - w 2 , 

we can express S in the form 

4 


s 

4 



X 2 -kr 


) - °L 


z 

Y 


+ 


or S — g + b. 

4 

The general solution of (8) will then be 
cp(f, S - g) = 0 

4 

where cp is an arbitrary differentiable function. It follows that 

S = g+ Q£f+a , 

4 4 6 

where C t and 0t Q are arbitrary constants, is a solution and may be taken 
as a complete integral of (8). By adding S , S , S q , S 4 we now obtain 

an integral of equation (7). As explained in the general discussion of 
Hamilton -Jacobi theory, the additive constant ct may be dropped. Also, 

by Theorem 5, the term aw can be added to give, finally, as a complete 

integral of the Hamilton-Jacobi equation (7) for the base Hamiltonian 

H , the following 
o 
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S = at+ae+Qfm + — (-U + I sin' 1 — ~ — ) - a sin' 1 % % — + a v. 

123 ^2 A i 2 1 T 5 

The Remaining Canonic Constants 

By Theorem 4 (Jacobi’s Theorem), if S is a complete integral of the 
Hamilton -Jacobi equation, then there are constants (3^, 3 s such that 

dS/dC^ = 3^ • 0x1 espying out the differentiations on the above S, we 

get 


3 , r / , k . -l X 2 - kr\ 

i " t “ (~u + sln Y ' 


-i Z 

3 = 0 - sin ^ > 

2 


. cr 3 / , k . -i X 2 - kr A 

3 = m + (-u + 7 sin — ~y — V = 


- c(3 i - t) 


P = -X 2 /r 2 , 

4 


3 = w . 

5 


The Multipliers 


Also by Jacobi's theorem the \'s are equal to the partial deriva- 
tives of S with respect to u, w, r, 9, m ; and the equations so 
obtained together with the above equations determine a ten-parameter 
family of solutions of the canonical equations for the base Hamiltonian 
H q . On letting B denote 


we find, after some simplification, the following results: 

X = [2rf3 - P 2 /r + (rp + k) kp 2 /rB] A/p 2 + (2a - 3A(t - P )/p )u 

145 5 5 44 14 

+ (kr - p 2 )a p /rB, 

5 2 5 
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X = [ur - 3P (t - P )] Ap _/r 2 p 2 + 2P- p /r 2 + a + [Akp ( r p - k) 

2 4 1 5 4 4 5 5 54 

+ a p 2 (rk + p 2 )'] u/rP 2 B, 

2 4 5 4 



The multiplier X can be computed in the same way as X and x > 

3 12 

but it is not needed for H 

l 

The above computed X and \ are expressed as functions of Of* s , 

1 2 

p’s, u, and r* The variable u can be eliminated, since from p = -X 2 /r 2 

4 

and X 2 = - p 2 + 2kr - u 2 r 2 we get 

5 

u 2 r 2 = p r 2 + 2kr - p 2 . 

4 5 

Also we have, from the £ equation, 

3 

m = (3 - c(t - {3 ) . 

3 1 


The H Hamiltonian 

l 

The perturbing Hamiltonian H = (F/m) vX 2 + r 2 X 2 can now be ex- 

1 1 2 

pressed as a function of Ct' s, p's, r, and t, and r is a function of 
p’s and t by means of the equation for 3 . As explained in the first 

part of this paper, H is now the Hamiltonian for the total problem in 
variables 0^, p , t. Consequently, the canonical equations for the 
extremals in these variables are 

\ = dH i /ap 1 , p ± = - aHyaaj . 

More explicitly, letting \ = + X 2 , 

12 
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a = 


cF\ 


1 (p - e(t - p )) 2 

3 1 


f(x ax ,/ap + r 2 x ax /ap + rx 2 ar/ap ) 

111 221 2 1 

x(p - c(t - P )) 


- Fx 


(P - c (t - p )) s 


f(x ax /ap + r 2 x ax /ap + rx 2 ar/ap ) 

113 2 2 3 2 3 

X(0 - c(t - p )) 


. f(x ax /ap, + r 2 x ax /ap, + rx 2 9r/ap.) 

rv 111 22 1 2 1 . _ 0 i, c- 

X(P - c(t - P )) , 1 - 2, 4, 5 


cc = 
1 


- p(x ax /aa. + r 2 x ax /aa.) 

« T(p„ -~c(t~- ~ P i )f ^ 5. 


Since \ and \ are linear in the CC' s, the differentiations in the 
1 2 

right members of the p equations are easily carried out. However, this 
is not possible for the & equations. 

The solution of the above system of differential equations gives the 
optimal trajectories of the rocket in terms of CU's, p's, t and ten 
constants of integration. Closed form solutions do not seem possible, so 
approximation methods by some type of iteration on r seem necessary. 


A SECOND METHOD FOR THE PLANAR PROBLEM 

This method involves a canonical transformation of variables and 
leads to a complete integral of the base Hamiltonian. As before, the 
perturbing Hamiltonian, with the canonic constants as new variables, be- 
comes the Hamiltonian for the total problem. The resulting canonical 
differential equations of extremals are somewhat different from those of 
our first method, but they again involve similar inherent difficulties 
and do not lead to closed form solutions. 


The Canonical Transformation 

Let the following transformation be made, where the q's denote 
the generalized coordinates and the p’s the generalized momenta. 
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This transformation is easily verified to satisfy the necessary and suffi- 
cent condition for a canonical transformation as given in the first part 
of this paper. 

The other transformations consisting of interchange of coordinates 
and momenta have been investigated. Changing r to a momentum variable 
greatly complicates the Hamilton -Jacobi equation. Changing 9 has little 
effect on either the base or the total Hamiltonian. Changing u only, 
or u and w , or u, w, and m to momenta give essentially the same S 
for the base Hamiltonian as does the above transformation. 


A Complete Integral of the Base Hamiltonian 
In the new variables, H q assumes the following form: 

H o = - k/q f } - P 1 P 3 + p A /q l ■ % • 

Hence the Hamilton -Jacobi equation is 

(10) S t + q i (q^/q^ - k/q|) _ + p^/q^ - cq^. = 0, 


where S^, p_^, p g , represent dS/dt, dS/dq^, 3S/dq^, and BS/dq , 

respectively. A solution of the above partial differential equation can 
be obtained by Jacobi's method [11]. 


dS dp 
t _ *4 

0 “ 0 


d P, 


« 
2 3 


k/qf 


% 

-P. 
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The first two terms give S - dS/dt = oc f p - dS/dq = Q : , and the last 

t 14 4 4 

two give 

p dp = (qjW* - k/q 2 )dq . 

1 1 2 3 3 3 

On integrating this we get 

pf = (-Off qf-f- gkq - qf)/qf 

1 3 3 3 2 3 

or p = W/q where W = + »J -Q 2 q 2 + 2kq - q 2 

13 “ 3 3 3 2 

The constant of integration, -0^ , has been chosen negative to give a 

3 

periodic solution. 

When the values for S , p , p are substituted in the equation (10) 

t 1 4 

and the result solved for p , we get 

3 



+ a sin 

— 4 


kq - q 2 
3 2 

q \/k 2 - a 2 

3 3 2 


The solution of the partial differential equation (10) is obtained 
from the exact differential 

dS = (dS/dt)dt + (dS/dq )dq + (SS/^q )dq + (dS/dq )dq 

1 1 3 3 4 4 

However, q and q need to be included as independent variables in 

2 5 

addition to t, q , q , and q ; so, by use of Theorem together with 

13 4 
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the above results, we get 


f q. Gt - cq \ 
= at+aq + [r 1 - — £ ) 

i -*4 \<l a 2 ) 


w + 


k (a - cq s ) 
a 3 


sin 


a 2 q - k 
-l 3 3 


3 2 


+ OL sin” 1 ^3 ” q 2 
- 4 


q -7 k 2 - c^q 2 


+ a q + a q 

2 2 5 5 


as a complete integral of (10) involving five parametric constants, the 
additive constant being ignored. Note that the term q^W/ q^ occurs 

twice in the integrations but is counted only once in S. 


The Canonical Constants p and Momenta p.^ 

By Jacobi's Theorem, dS/dC^ = p , with arbitrary & ± . Let 


C = a - cq , D = \Tk 2 - Q^q 2 . 
i 5 3 2 


Then 


p « t - w/a 2 + (k/ct 3 ) sin" 1 (a 2 q - k)/D, 

1 3 “ 3 3 3 

K = % ’ 

2 2 

B = 3C(t - 6 )/a - CW/a? - Of q q /W + q 2 C/a W + kq C/aPw 

3 1 3 3 313 33 33 


(k 2 C + a* a q )(kq - q 2 )/a?WD 2 
3 4 2 3 2 3 


P = q + sin“ 1 (kq - q 2 )/q D , 

4 4 — 3 2 3 


p 5 = q s • 

Where ambiguous signs occur above, the top sign is to be taken if W is 
chosen positive, otherwise the lower sign. 
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Since = dS/dq^ , it follows that 



p = a + [c/a 2 - a /q + (kC/c^D 2 + a k/q D 2 ) (o^q - k) 


- q. /q o ](q /w), 

13 2 

p = q. C/W + (a q - kq )/q W + q q^/qfW , p = a . > 

3 3 42 13 123 44 

p = a + cW/a 2 + (kc/or 3 ) sin -1 (cr’q - k)/D = a + c(t - p ) . 


The Hamiltonian H 

l 

Application of the canonical transformation to the original H gives 
H = (-F/p ) Vq 2 + p 2 q 2 . 

1 5 12 3 

To express H in terms of o^'s, P's, t and q , we find from p = q . 

1 3 2 2 

P = q , and the p equation that 

5 5 3 

q = [3C(t - P ) - CC p - CW/a 2 + q 2 C/w](W/a^q)+ [kq CD 2 + (k 2 C + C^a B ) 

1 133 33 333 342 

(kq - P 2 ) ]/Q^q D 2 , 

3 2 3 3 

where now C = a - cp , D = \Tk 2 - Q^p 2 ,W ■ + •J -p 2 + 2kq - O^q 2 . 

1 5 32 “ 2 333 

This value of q substituted in the formula for p above gives 
1 2 

p = a - a/w + p cw/a^q 2 - kp c/a^q w + [a p p - 3p c(t - p 

22 4 233 233 323 2 1 

+ [°^kq 2 (P C + Q^a )(a 2 q - k) + p (k 2 C + a*a p ) (kq - ^ 2 )]/a 4 q 2 D 2 W . 

332 3433 2 342 3 2 33 

By using the expressions for p , q , and p as above, we can 

2 1 5 




2 

3 “3 
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reduce H to a function of OS's, 3’s, q , and t. By the equation for 
1 3 

3 , q is an implicit function of a's, 3*s, and t. Hence becomes 

a function of Q!'s, p's, and t, and is then the Hamiltonian for the 
total problem. The canonical equations of extremals giving the optimal 
trajectory can then be obtained as in the first method. The analysis again 
is very involved and does not lead to closed form solutions, so we do not 
proceed further here. For another treatment of this problem one should 
refer to W. F. Powers [12], The search for canonical transformations 
which will give simpler forms for at and should be continued. 
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Consider a dynamical system whose equations of motion are 


Pi 


8fl (qj :Pj 

3Pi 

3 qi 


i=l ,2 , . . . ,n ; j = l ,2 , . . . ,n 


CD 


where the Hamiltonian, is understood to be a function of the gen- 

eralized coordinates, qj , and their conjugate momenta, pj, j=l,2,...,n, and 
possibly the time, t. If one-half of the integrals ofEqs (1) have been 
obtained in a suitable form, there is a well-known theorem, due to Liouville,* 
which may be used to find the remaining integrals. The purpose of this note 
is to point up the related, but perhaps not so well-known fact that a method 
of obtaining a complete integral of the Hamilton -Jacobi partial differential 
equation associated with (1) is implicitly contained in the theorem. Since a 
complete integral of (1) will permit us to express the solution of (1) in 
terms of canonical constants of integration, recognition of this fact is of 
importance in studying perturbations of the original system. The method will 
be discussed and applied in what follows. 


Suppose that n integrals of a dynamical system with 2n degrees of free- 
dom are known in the form 

$i(qj;Pj;t) = ct*, i*l,2,...,n; j = l,2,...,n (2) 

where the form a set of n independent constants of integration. If the 
Poisson bracket expression, vanishes for each i and j and if the 

are solvable for the p^ in the form 


*E. T. Whittaker, A Treatise on the Analytical 
Rigid Bodies (New York: Cambridge University Press, 


Dynamics of Particles and 
1959), pp. 323-325. 
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Pi = fi (qj ;t) , i=l,2,...,n; j-l,2,...,n 

the Liouville theorem states that the difference between 


n 



and # (qj ;cij ;t) dt is the perfect differential of a function w(qj;aj;t) and 
that the remaining n integrals of the system are given by 


9W 

3 


i=l , 2 , 


(4) 


where the 6^ form a set of n constants of integration which are independent 
of each other and of the set formed by the . 


To say that 


2 f i d qi - (q^a.jtjdt, j=l,2,...,n 
i = l J J 

is the perfect differential of a function w(qj;aj;t) means that 


(5) 


aw 

aq. 

aw _ 
at 



i = l ,2 , 


( 6 ) 

(7) 


Thus, implicit in the Liouville theorem is the fact that the function W is a 
complete integral of (7) which is the Hamilton -Jacobi partial differential 
equation associated with the system. 


When the n integrals of (2) can be solved for the q^ instead of the p^ , 
i=l,2,...,n, the theorem may also be applied, if the canonical transformation 

Qi = Pi 

p i = -<ii 

to new variables (Qi>Pi) is first introduced. Even if we are not able to 
solve the n integrals (2) explicitly for the p^, or for the q^ , a complete 
integral may still be obtained in certain important cases now to be discussed. 

Suppose we are able to solve the integrals (2) explicitly for £(£ < n) 
momenta and n-£ coordinates. Suppose further that, after reordering the sub- 
scripts, the expressions for the £ momenta and n-£ coordinates can be written 


V (8) 
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in the restricted form 


Pi fak »Pm> a j »*) » 

i-1 ,2, . . . ; 

m>£; j=l,2,. 

k<l; 
. . ,n 


1i = h i( c lmiPk;«ji t )> 

i=A+l ,&+2, . . 
m<£; j = l ,2 , . 

. ,n; k>£; ! 

J 

► (9) 

By introducing the canonical 

transformation 



Pi* = Pi ^i* = 

p.* = -q. q.* = p.. 

i=l,2 , . . . ,£, 
i=Ul,A+2,.. 

1 

J 

| (10) 

(9) may be written in the form 




Pi* = 

i=l ,2 , . . . ,n ; 

j-1 »2 , . . . ,n 

(11) 


Equations (11) are in the form of (3), and the theorem maybe applied. 


Example 1 : Central Orbit in the Plane, Polar Coordinates 

For a particle moving in a plane under a central force derivable from 
the potential V(r), the Hamiltonian function is a constant . If we desig- 
nate by (p r ,p 0 ), the momenta conjugate to the polar coordinates (r, 0 ), respec- 
tively, (see Figure 1), the system has the well-known integrals 


Pe 


a 2 , a constant 



+ v(r) ] 



( 12 ) 

r in 


From (5) , we write 


dW = p r dr + p e d 6 - ajdt 


(14) 


If r Q is chosen so that no new independent constant is introduced, the func- 
tion 



(15) 


obtained by integrating (14), satisfies (7). Also, W is a complete integral 
of (7) since it contains two non-additive independent constants 04 and a 2 . 
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y 



Figure 1 


Example 2 : Free Motion of a Triaxial Rigid Body 

For the free rotations of a triaxial, rigid body about a fixed point 0, 
the Hamiltonian function, which is a constant of the motion, oij , may be writ- 
ten in terms of the Euler angles which specify the position of prin- 

cipal axes at 0 relative to space- fixed axes 0£nC and their conjugate momenta 
(Pe>P<j>*Pip • See Figure 2. 


C 



Figure 2 
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Three known integrals for this dynamical system are 2 




a 3 , a constant 


(16) 



where A, B, and C are the principal moments of inertia at 0 and a 2 *- s the 
constant magnitude of the angular momentum about 0. 

Although it is not possible to solve (17) and (18) so that p^ and pg are 
expressed in the form of (3), the set of equations (16), (17), and (18) is of 
the form of (9); hence, the canonical transformation 


Pi = ‘ 

P 2 = “ 9 > <*2 = 

P 3 = P^* <* 3 = 

allows us to write (16), (17), and (18) in the form of (11). Then, from (5), 
we write 

dW = pidqj + p 2 dq2 + P3 d 93 “ a l dt ( 20 ) 


p e 




If qiO and q 20 are chosen in a manner which introduces no new independent 
constants , the function 


2 See Whittaker, p. 325. 
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obtained by integrating (20), is a complete integral of (7). 
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AN OFFSET VECTOR ITERATION METHOD FOR SOLVING TWO -POINT 
BOUNDARY-VALUE PROBLEMS 

By C. F. Price* 


An offset vector iteration technique is proposed for solving two-point boundary-value problems. 
In this paper the properties of the method are explored. Application to parameter selection is 
first considered and convergence properties are described; comparison is made with other 
numerical methods. The two-point boundary-value problem is shown to be equivalent to the 
parameter selection problem. The method generally has a lower convergence rate than second 
order techniques; however, in many applications each iteration requires relatively few computa- 
tional operations. Therefore it is competitive with higher order numerical procedures in 
applications that require few iterations to obtain an acceptably accurate solution. A modification 
to the offset vector method is suggested which takes advantage of the finite difference information 
generated at each iteration. 

(First received September 1967 and in revised form February 1968) 


1. Introduction 

The use of offset vectors to develop iterative techniques 
for solving two-point boundary-value problems is a 
numerical procedure that has been proposed and 
investigated for use in near-earth (Godal, 1961), (Price 
and Boylan, 1964) and interplanetary guidance applica- 
tions (Baffin, 1964a), (Slater, 1966), The advantage of 
the method, when it can be applied, is that each 
iteration is often computationally simple to mechanise, 
relative to other techniques. In fact, there is evidence 
that it converges sufficiently rapidly in some cases to 
permit its use in real-time airborne guidance systems 
(Price et al 1964). This study was motivated by the 
desire to utilise an offset vector method for solving 
certain two-point boundary-value problems that repre- 
sent necessary conditions for optimal trajectories. An 
example of such an application is presented in a recent 
paper (Price, 1967). 

The concept of the offset vector method is easily 
understood and motivated through a simple, familiar 
example. Consider the problem of hitting a target with 
a projectile fired from a gun that is stationary with respect 
to the target. Let the direction of the gun barrel on the 
yth shot be designated by a unit vector, i Jf j = 1, 2, . . ., 
expressed in an appropriate coordinate system. On the 
first shot, y=l, i*i is some function, 

h — h( r r)» 

of the target’s position, r T . Suppose the first shot 
misses the target by a miss-vector, Arj, such that an 
impact point, r u is defined by 

r, = r r -f Ar,. 

Using whatever quantitative knowledge of the miss he 
has, the gunner attempts to make an intelligent choice 
of the pointing direction on the next shot. If it happens 


that i 2 is expressed in the functional form (however 
crude) 

h = h ( r t ~ Ar,) 

where (r r — A#^) is a ‘dummy’ target position, we say 
that an offset vector iteration technique is being used. 
By analogy, on the £th iteration 

*k + 1 = *k+ i( r r ~ Ar, - Ar 2 - . . . -Ar fc ); k =* 1, 2... 

The philosophy is that on each iteration the aiming point 
is changed by the negative of the miss-vector. It is 
shown in this paper that such an approach is applicable 
to solving two-point boundary- value problems; in fact 
the above example can be formulated as such a problem. 

Offset vector methods are ad hoc in nature because no 
general quantitative prescription is given for implement- 
ing the iterations. In the projectile example, the 
functional form of i k+ ,( ) depends upon the sophisti- 
cation of the fire control system. This point is 
emphasised in the subsequent discussion. However, it 
appears that the convergence properties of the technique 
can be described, to some extent, without reference to 
any special application, and comparisons can be made 
with other numerical procedures. That is the primary 
purpose of this paper. 

In the next three Sections the concept of offset vectors 
for solving parameter selection problems is more 
precisely defined, convergence properties are described, 
and a simple example is presented. In Sections 5 and 6 
it is shown that the two-point boundary-value problem 
reduces to that of parameter selection and results of 
utilising the method in a typical physical application are 
given. In Section 7 a modification to the offset vector 
method is suggested which takes advantage of the finite- 
difference information generated at each iteration. This 
provides a means for making a transition from the offset 
vector method to a finite-difference version of the 


* Staff Member, Experimental Astronomy Laboratory , Massachusetts Institute of Technology, Cambridge, Massachusetts. 
This research has been sponsored by NASA ERC Contract No. NGR 22-009-207. 
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Newton-Raphson technique in situations where many 
iterations are required. 

2. An offset vector method for solving the parameter 
selection problem 

Parameter selection or equation solving is simply the 
task of finding a value of an rc-dimensional vector 
x = Jr® which satisfies the vector equation 

g(x) = 0. (1) 




In parameter optimisation problems, equations of this 
form are necessary conditions that a function <f>(x ) have 
a stationary point. We assume that g(x) also has 
dimension ti and that at least one solution of eqn. (1) 
exists. 

Numerical techniques for solving eqn. (1) depend 
upon having an initial guess x 0 that is ‘near’ the desired 
solution Xoo and improving that guess by iteratively 
generating a sequence {x 0 , jc,, . . .} which converges to 
x m . Criteria for convergence of the sequence are 
usually given in terms of sufficient conditions satisfied by 
g(x) in a region about containing x 0 . 

The most important property of any particular 
numerical method is the total time required to achieve a 
sufficiently accurate solution for x This is dependent 
upon two factors — the number m of iterations required 
to obtain a value x m that is sufficiently close to Xoo, and 
the computational complexity of each iteration. One 
often observes that these factors are inversely related; 
that is, the simpler each iteration is to perform, the more 
iterations required to obtain a desired level of accuracy 
in the solution. This characteristic is evidence of the 
fact that the amount of progress made in each iteration 
toward i.e. the convergence rate, depends upon the 
amount of information used about g(x) in deriving the 
recursion expressions. 

, Because the total time required for convergence is 
often dependent upon inversely related factors, it is 
difficult to state a priori in any particular application 
which of the various numerical methods is most 
advantageous from a computational point of view. 
However, if any initial guess x 0 is quite close to x m , 
relatively simple iteration techniques may accomplish 
the required degree of accuracy with no more, or few 
more, iterations than more elaborate methods. This 
rationale provides the motivation for describing an offset 
vector iteration technique which is potentially simple to 
implement and is based upon the idea of having a 
reasonably accurate initial guess x 0 ; in fact, the structure 
of the method is defined by the manner in which x 0 is 
chosen. 

Suppose one can find an n-dimensional vector function 
g(x) that approximates g(x) such that the solution 
x s x 0 of 

g O) — o (2) 

is relatively easily determined.* For example, g(x) and 



Fig. 1. Graphical development of the first two iterations of 
the offset vector method applied to a scalar function g(x) 


g(x) may be of the form 

£(*) = g -h Gx + cf(x) - <n 
g(x) = g + Gx ” 0 / 

with c a constant scalar, g a constant vector, G a non- 
singular matrix and /(x) some nonlinear function of x. 
If the term e/(x) is small relative to g(x) for x near x«, 
the solution x 0 = — G~'g, is near x Let us write the 
solution to eqn. (2) as 


x 0 =g-m=m w 

where g _l ( ) represents the required inversion of g( )» 
and the argument 0 refers to the value of the rjgltf-hand 
side of eqn. (2). The situation is illustrated graphically 
in Fig. la for n = 1 . 

Having x Q , we can evaluate 


g(x o) = £o> (5) 

noting the eqn. (1) is in general not satisfied, that is, 
g 0 # 0. Based upon this observation an improvement 
to x 0 can be determined by the following reasoning. 
Suppose g(x) differs from g(x) by only a constant vector 
/ 0 , that is, 

g (x) — g (x) + /o ; for all x. (6) 

Then 


g(*o) = “/o = go- 


lf this be true, the solution to eqn. (1) is also the solution 
to 


g(*)-/o = 0 


or 

g(x) = -g 0 - ( 7 ) 


* This is not to say that *o need be determined by an explicit 
formula; the solution to' eqn. (2) may also have to be obtained 
numerically. An example of this kind is given in Section 6. 


Thus we offset the approximating function by the 
negative of the error determined in eqn. (5) and calculate 
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Xi from eqn. (7), using the notation of eqn. (4). 


(y.-- 1 — Si) from eqn. (14) yields 


= K-So)' 


( 8 ) 


Vi = -Ag[/j(y f _,)]. 


(18) 


This sequence of operations is illustrated in Fig. lb. 
The quantity — g 0 is analogous to — Arj in the projectile 
problem of the previous section. 

In general, x x does not satisfy eqn. (6) either, as 
evidenced by 

S(*i) = gi^0. 


Accordingly, replace eqn. (6) by the conjecture 


which leads to 


resulting in 


g(x) = g(x) - go 

(9) 

g(x t ) = ~/l = gt 


g(x) = —go - g, 

(10) 

*2 — K-go - gt )• 

01) 


These steps are shown in Fig. lc. 

The recursion relationships required for the con- 
tinuation of this method are readily inferred from the 
preceding discussion. Define 


gj — g(Xj) 

y i =-I t gjl /=— 1 , 0 , 1 ,... 

./--t 

y - 1 = —g- 1 = 0 

and let 

£(*«) = y,-i; * = o,i,... 

Then, 

Yi “ Yi- l - <?.•; < = 0,1,... \ 
Xi=g~ '(y/- 1) = h(yi_ i) / 


( 12 ) 


(13) 

(14) 


At each iteration one evaluation each of g( ) and h( ) 
is required. The quantity y, is referred to as the offset 
vector. Now we shall discuss circumstances in which 
the sequence {x 0 , x x , . . .} generated by eqns. (12-14) 
converges to x^. 


3. Convergence properties 

One expects that the convergence properties of the 
offset vector method depend upon the accuracy with 
which g(x) approximates g(x). To pursue this reasoning 
define an error vector Ag(x) by 

g(x) =§(x) + Ag(x). (15) 


Equation (18) is equivalent to eqn. (14) and is the 
recursion for solving 

y = -Ag[A(y)]. (19) 

by successive approximations. The solution, y TO , to 
eqn. (19) is the limit of the sequence of offset vectors 
{y 0 , y i, - - •}. Viewed another way, it is the value of 
g(Xco). (See Fig. 1.) 

Sufficient conditions for the convergence of the 
sequence {y,} are known for successive approximation 
iteration methods. For example, convergence is 
assured (Todd, 1962) if A g[h( )] satisfies the Lipschitz 
condition 

max j Ag [A(yO] — Ag[/2(y")]| < k ma x[y' — y" |; 

0 < A: < 1 (20) 

for all y' and y" in a neighbourhood of y ro containing 

Y- i = 0- 

Alternatively, a recursion relationship for x, can be 
derived from eqn. (14). Substituting for y i _ l and y,_ 2 
from respectively eqns. (14) and (13), we have 

*i = h[-\g(x i _ l )]. (21) 

The solution of this expression with x ; and x,_ i replaced 
by x is the value of x — x* that renders g(xj) — 0 and 
g(Xc o) = y«- 

A third way of viewing the iterative procedure is that 
the sequence (g 0 , gj, . . .} of evaluations of g(x*) is being 
driven to a limit of zero. This is perhaps the most 
natural point of view for the applications to be con- 
sidered subsequently. From eqns. (12)— (14) it is evident 
that g(x ( ) is a nonlinear function of all g{xj),j < i, of the 
form 


Si = d*(0 — So - tfi “ ■ • • Si- i)I- (22) 
Similarly, 

ft+ 1 = 0 “ So - gt - • • • Si)l (23) 

Linearising g i + 1 about g t with substitution from eqns. 
( 1 2)— ( 1 4) we have 

g l+l ^g i - G( Xi )H(y Os,; i = 0, 1, . . . (24) 

where 


G( * )s i£ ): H(r) 


_ Wy) 

t)y 


(25) 


Substituting x, for x, we have 

g(*i) = g(xi) + Ag(x f ). (16) 

Into eqn. (16) we can substitute for g(x f ) and x, from 
eqns. (13) and (14), producing 

.YI Z'17'l 

oi / i— i i \- • / 

Rearranging terms and substituting for the quantity 


Equation (24) indicates that 

lim g, = 0 

/-* oc 

if ||7 — GH\\ < 1 (26) 

in some sufficiently small region about x* such that the 
linearisation is valid. Note that if g( ) = g( ), GH = I. 
These convergence properties provide a comparison 
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between the offset vector method and other procedures 
that can be employed for finding Considering 

eqn. (19), perhaps the most significant observation is 
that the method does not possess second-order conver- 
gence because the gradient matrix corresponding to 
eqn. (19), 

frAg[/i(y)] | _^ 0 
koo 

n general (Todd, 1962). Thus a Newton-Raphson 
technique, beginning at x 0 may require fewer iterations 
to approach within a desired accuracy. However, 
the offset vector method possesses two advantages that 
motivate its use in certain situations. 

First, applications arise in which g(x) cannot be 
expressed in closed form, such as the solutions of many 
two-point boundary value problems. In these cases 
every evaluation of g(x) requires numerical integration 
of differential equations. In addition, for Newton- 
Raphson-type procedures the gradient matrix must also 
be computed numerically, requiring additional complete 
integrations of the appropriate differential equation ; for 
each iteration. Hence, if the approximation g(x) is 
sufficiently accurate, one may conceivably reach a point 
sufficiently close to with an offset vector technique 
before a higher order method gets started. The offset 
vector method has proved sufficiently rapid in situations 
of this kind to be incorporated in a real-time airborne 
guidance system (Price et al., 1964). An example of 
such an application is included in Section 6. 

Second, the offset vector method is a reasonable 
starting procedure for a higher order method in situa- 
tions where many iterations are required;, The points 
jc 0 , Xj, ... and associated values g 0 ,g i, . . . can be 
stored to provide corrections, based on finite differences, 
to subsequent evaluations of x { . A possible method for 
accomplishing this is described in Section 7. 

There is the disadvantage that some means must exist 
for finding an appropriate g(x). Whether this can be 
done depends upon the particular problem and the 
analyst’s ingenuity; for this reason the concept of offset 
vectors does not provide a ready-made numerical 
algorithm for attacking all parameter selection problems. 
The fact that applications are known (see the references 
mentioned in Section 1 and the example of Section 6) 
where the method can be applied is a testimonial to its 
usefulness. 


4. Example 1 

To illustrate the offset vector method, a simple one- 
dimensional example is presented using equation 
numbers corresponding to those expressions in preceding 
sections which are exemplified. 

Given 

*(*) = !+* + = 0. (1) 

Let 

i(x) = 1 + *. (2) 


Then 

Yi = Yt-i ~ Si 

Xs = h( Yi - 1) = Yi-i “ O 4 ) 

Using the criterion for convergence provided by eqn. (26), 
we find that 


G(x) = 1 + 3cx 2 ; H{y) = 1 (25) 

3x 2 |e| < 1. (26) 

Furthermore, from eqn. t-4) 


Si + 1 
g< 


& 3* 2 H 


(24) 


which provides a measure of the convergence rate. 

It should be emphasised again that the offset vector 
method is not promoted especially for a high conver- 
gence rate. In general, and for this example in par- 
ticular, it converges more slowly than Newton’s method. 
The main advantage is the relative simplicity with which 
each iteration can be performed. This is illustrated by 
observing that the recursion relationships in eqn. (14) 
for this example require two subtractions and one 
evaluation of g(x) per iteration. On the other hand, 
Newton’s formula. 


*i+i 


• gw g l ,> 


dg(x) [ 
dx U-*« 


requires one subtraction, one division, one evaluation of 
g(x), and one evaluation of dg(x)jdx per iteration; 
clearly this entails significantly more computation. The 
total time required to obtain an acceptably accurate 
solution for x* is less for the offset vector method if |e| 
is sufficiently small so that only one iteration of either 
method is required. 

In situations where g(x) has several dimensions and a 
complicated functional form, the computational advan- 
tages offered by an offset vector method are more signi- 
ficant. As mentioned previously, it is competitive with 
higher order techniques when a sufficiently good approxi- 
mate solution can be obtained. In applications where 
the problem must be solved repeatedly, as in rocket 
guidance systems, considerable computational saving 
may be gained. This is illustrated by the example in 
Section 6. 


5. The two-point boundary-value problem 

The use of offset vectors to develop iterative techniques 
for solving two-point boundary- value problems is a 
numerical procedure that has been applied to near-earth 
(Godal, 1961), (Price et al ., 1964) and interplanetary 
guidance (Baffin, 1964a), (Slater et at, 1966) problems. 
In this section it is shown that the convergence properties 
can be stated in the same terms as for the parameter 
selection problem. 

A two-point boundary-value problem is posed by 
assuming a given dynamical system described by 
n-dimensional vector differential equations 

*=/(*, t) (27) 
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with prescribed end conditions 


Wo), *ol = 0 \ 


(28) 


where t 0 and t f are initial and final times, x is an 
^-dimensional state vector, d> and ip are respectively 
/- and m-dimensional vectors, with / + m — n tf 2. It 
is assumed that a solution exists which cannot be 
determined in closed form, requiring the use of numerical 
techniques. 

We shall regard the solution to eqn. (27) known when 
the complete set of initial conditions x{t 0 ), r 0 is deter- 
mined such that eqns. (27) and (28) are satisfied. The 
explicit dependence upon eqn. (27) is conceptually 
eliminated by writing the solution as 


x(t) = x[x(t 0 ), t 0 , t ) (29) 


so that eqn. (28) becomes 


sw '.)’ ,o ’ ~ [«xwt),°L //], </}] - ° (3o) 


Equation (30) has the form of eqn. (1) where the 
parameters to be determined are x(t 0 ), t 0 , and t r . 

The offset vector method is implemented in a manner 
analogous to that described in Section 2. Approximate 
solvable relations 

g[*(*oUo,*/] = 0 (31) 

are derived, often by means of a simplified set of 
differential equations 

x = f(x, t), (32) 

subject to eqn. (28). For example, eqn. (27) may 
describe motion in a many-body gravitational field and 
eqn. (32) may represent an approximating two-body 
model with eqn. (28) specifying the initial and final 
positions at specified times. The solutions Xo(r 0o ), *o 0 » 
and t fo of eqn. (31) are entered as initial conditions into 
eqn. (27), and the differential equations are integrated 
from / 0o to t fo producing 

*o('/,) = 4*o('oo)> '/.]• (33) 


Substitution of t 0o , t fo and x 0 (r 0fl ) for t Q , t f , and x(*o) in 
eqn. (30) yields 

g[x 0 (t 0o ), t 0o , l,J = go ^ 0 

in general. Defining the vector 

Z T = [x(to) T , to, tf], 


the iterative computation of the sequence {z 0 , Zj, . . .} 
proceeds just as in Section 2 with the understanding that 
each evaluation of 


g[Xi{t 0i ), t 0i , t fi ] = gj 

requires integration of eqn. (27). 

The motivation for using offset vectors is now more 
apparent. Vis-a-vis higher order methods it may be of 


considerable computational advantage to obtain even an 
algebraically complex form of eqn. (31) if computation 
of the gradient of g[x(r 0 ), f 0 , t f ] is thereby avoided. A 
practical multidimensional example of this type is 
considered in the next section. Observe that the pro- 
jectile problem discussed in the Introduction can also be 
formulated as a two-poin,t boundary-value problem and 
its solution obtained in the manner described above. 

6. Example 2 

This section discusses an application of the offset 
vector method to a practical two-point boundary-value 
problem. Equation numbers denote those expressions 
in previous sections which are exemplified. 

Consider the motion of a body in a planar orbit in 
the earth’s gravitational field. If the earth’s rotation and 
atmospheric friction are neglected,* the equations of 
motion are reasonably accurately represented by 


X = V x 



X 

r JEA 2 5JEAV] 


v *=-?\ 

[ £+ r, - „ J 

! 

z = v z 



zj 

r JEA 2 5JEA 2 z*-\ 

2JEA 2 z 


[* + - 7 T- r 4 J 

~ r 5 


where A is the equatorial radius, J and E are constants, 
r = VC* 2 + z 2 ), an( * x and z are position coordinates in 
an orthogonal coordinate system with the z axis along 
the earth’s polar axis. Because the orbit is polar, only 
two dimensions need be considered. Equations (27) 
describe the gravitational accelerations including the 
effects of the earth’s slightly elliptical shape. Let us 
pose the problem of finding the initial velocity com- 
ponents, v x (t 0 ) and v r (/ 0 ), required to transfer a body 
from a given initial position at time t 0 = 0 to a given 
final position at a specified final time. Hence 
r 0 = 0 tf — T f = 0 ] 

x(t 0 ) — a x = 0 x(t f ) — £* = ()> (28) 

z(*o) — a z = 0 z(t f ) — b x = 0 J 

where a x , a z , b x , b z , and T f are given. 

For the case where the earth’s oblate effects are 
neglected (J — 0 in eqn. (27)), the task of finding the 
initial velocities subject to the given conditions is the 
familiar Lambert’s problem of classical mechanics. For 
this case eqn. (27) can be integrated analytically by 
changing the independent variable; several methods of 
obtaining explicit expressions for g(x) are known 
(Battin, 1964b). For J ^ 0, there is no known method 
of integrating eqn. (27) analytically; hence a numerical 
technique is required. 

The offset vector method is naturally adapted to this 
application by using the known solution to Lambert’s 

* It is recognised that neglect of the earth’s rotation contradicts 
the intent of treating a practical example. However, this effect can 
be included without changing the qualitative interpretation of the 
numerical results; it is omitted only to reduce the complexity of 
the discussion. 
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next iteration 

Fig. 2. Computational flow diagram of the ith iteration in 
Example 2 


problem with J = 0 as an approximation. Introducing 
J — 0 into eqn. (27) produces a set of equations repre- 
sented by eqn. (32) in Section 5. For the terminal 
conditions prescribed by eqn. (28), one form of g(x) due 
to Godal (Baffin, 1964b) is given by 


v x (0) 

v z (P) 

C, - 

c? — 


- C x {b x - C 1 a x ) = 0 

- C l (b z - C 2 a z ) = 0 

ViEP) _ Q 

r f r 0 sin 6 

1 + ^(1 — cos 0 ) = 0 


r o — 
r f~ 
0 

P 

B 

T t 


+ a]) = 0 
V(t> 2 x +b*) = 0 

- cos - ’ [{a x b x + a.b z )jr 0 r f ] — 0 

VirpTf) sin 2 0-50 
(B — cos a) cos 0 • 56 ~ 

- 0o + r f)l2 V(r 0 r f ) cos 0* 50 = l 

- 2j(V('* 0 '7) cos 0*50) l - 5 ^/ ( — 

[l _|_ — cos a)(2qc — sin 2oe ) ~|l _ Q 

[ 2 sin 3 a J J — ■ 


- cos a’ 
E 


l ) 


The solutions to eqns. (31) are the proper initial velocities 
to achieve the conditions in eqns. (28), neglecting the 
oblateness of the earth. Observe that eqns. (31) are 
transcendental in a; therefore their solution must be 
obtained numerically. This represents a situation where 
eqns. (2) cannot be inverted analytically. 

The offset vector method proceeds by carrying out the 
following steps : 


1. Denote the solutions of eqn. (31) as v*o(/o) an d 
t^oOo) » these are obtained by any convenient 
numerical method. Newton’s method has been 
used in this simulation. 


2. Integrate eqn. (27) from r = 0 to t = T f using 
a xi a z , v x0 (t 0 ) and v*o(/o) as initial conditions. 
Denote position on this trajectory by x 0 (t ) and z 0 {t). 

3. Evaluate the left hand sides of eqn. (28) for the 
integrated trajectory. Define 


bx^Tf) m x 0 (7» -b x 
AzoCr,) = z 0 (7» - b t . 

4. Recompute the initial velocities from eqn. (31) by 
requiring 

x(Tj)-b x = -A*„(7» 
z(T f ) - b, - -Az„(7». 

This implies that eqn. (31) undergoes the changes 
of variable, 

b x —^b x — A x 0 (Tf) b xo 
b,^b z - Az 0 (T f )^-b Z0 . 

Denote the solutions as w*i(/ 0 ) and v zl (t 0 ). 

5. Repeat steps 2 through 4 in an iterative fashion. 
The functional diagram in Fig. 2 illustrates the 
steps at the ith iteration. 

For this simulation the following parameter 
values are used: 


a x = 2-093 x 10 7 feet E = 1-407645 x 10 16 

flj = 0-0 feet /= 1-62345 x 10 3 

b x = 0-0 feet A = 2-093 x 10 7 feet 

b z ~ 3-0 x 10 7 feet T f = 2400-0 seconds 

This roughly represents insertion into a 2000-mile 
altitude orbit at a point above the pole from a point on 
the equator. The computation was performed in double 
precision arithmetic on an IBM 360/65 computer. 
Newton’s method is applied to solve Lambert’s problem 
and a Gill-modified Runge-Kutta integration technique 
is used to integrate eqn. (27) with a 20 second time step. 
The values of terminal position error, A x t {Tj) and 
Az^Ty), for two iterations are given in Table 1: 
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Table 1 


Position error data from simulation 

(Rounded off to 3 significant figures) 


ERROR 

QUANTITY 

i = 0 

ERROR WITH NO 

ERROR AFTER ith ITERATION 

CORRECTIONS TO 
LAMBERT SOLUTION 

I = I 

/ = 2 

|A*,(7»| 

FEET 

2-34 x 10 4 

FEET 

1*51 x 10 1 

FEET 

4-41 X 10” 2 

||A*«0»| 

5-37 x 10 4 

7*25 X 10 1 

1 

9-56 X 10' 2 



(a) *- space X( — ► 



(b) g- space Q, — ► 

Fig, 3. Progress of finite-difference modification of offset 
vector method in two dimensions 

Adequate accuracy is obtained in one iteration for many 
applications. For these cases any other numerical 
method that has an equal or greater convergence rate 
can be compared on the basis of the computational 
complexity of each iteration. 

In this simulation the time required to solve Lambert’s 
problem with sufficient accuracy is approximately 
0-01 seconds whereas that required for integrating eqn. 
(27) is 0-30 seconds. Because the latter* dominates 

* An integration step three or four times larger than 20 seconds 
would give terminal position accuracy better than 100 feet in this 
example. 


the former, any method that requires more differential 
equations to be integrated is at a competitive dis- 
advantage with the offset vector method. For Newton- 
Raphson type procedures, the gradient matrix of g( ) 
with respect to v xj (t 0 ) and t^(r 0 ) must be obtained. This 
can be obtained numerically by perturbing each velocity 
component separately and integrating eqn. (27) to 
determine the effect on the end conditions. Obtaining 
the complete gradient matrix by this procedure requires 
n additional complete integrations of eqn. (27) per 
iteration; this results in tripling the amount of integra- 
tion required in this example, effectively tripling the 
computation time for each iteration. The gradient 
matrix can also be obtained by integrating the linear 
variational equations associated with eqn. (27); how- 
ever, the increased computation is of the same order as 
that required to obtain the matrix by the perturbation 
technique. 

These comparisons indicate that the offset vector 
method is superior to higher order methods in some 
problems. The example considered here has application 
to rocket guidance for which the thrust is directed so 
that the vehicle’s velocity matches the values of v xi (t 0 ) 
and v zi (t 0 ) in Fig. 2. The two-point boundary-value 
problem must be solved many times in rapid succession 
because the initial time and the rocket’s position are 
constantly changing. For ‘real-time’ computation of 
this sort, speed is a primary consideration. 


7. Modified offset vector method 

In Section 3 it is pointed out that the offset vector 
method can serve as a starting procedure for higher-order 
techniques. The possibility for doing this is evident at 
the (n 4- l)th step after the sequences {x 0 , x u . . . x„} 
and (g 0 , £i, • • . g„} have been computed. Defining 


Ax,- « Xj — X;_ x ) 
Ag, = gi -gi-i) 


(34) 


we have sufficient information to derive an approximate 
gradient matrix (or its inverse) provided the Ax/s (or 
Ag,’s) are independent. For example, 


C = X' U S (35) 

where 0 and X are matrices whose ith columns are 
respectively Ag,- and Ax,-. Faster convergence may 
possibly be obtained by continuing the numerical pro- 
cedure with a Newton-Raphson-like technique using Xj 
to determine new values of x according to 

x f+l = ~Grig f (36) 

where G, depends upon the last n values of Ag and Ax. 
In this section we shall describe a recursive method 
whereby the gradient information available at each stage 
is utilised to adjust the offset vector computation, 
producing results analogous to eqn. (36). 
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Consider the first two steps in the offset vector method 
after which x 0 , x u g 0 and g, are known. These ‘points’ 
are indicated for a two-dimensional case in Figs. 3a and 
3b. With Ax, and A g, thereby determined, we can 
calculate the required first order change Ax,' in x to 
produce a desired change Ag' in g in the direction of 
Ag,- 


A * ; = m 


(37) 


Note that Ag' is a scalar that may be either positive or 
negative. Out objective being to drive g to zero, to first 
order (approximately*), we can remove that component 
in the direction parallel to A g, by defining 


Agi 

Agi = (ffl *Afi — 

Ax,' = -(gi./A fl )Ax,/|Agil 


x[ = *i + Ax| 

— £i + Agi* 


( 38 ) 


These quantities are illustrated in Fig. 3. Note that 
—(gj.iA gl ) in eqn. (38) plays the role of Ag' in eqn. (37). 

There is as yet no gradient information available in 
the direction normal to Ag, so, at this point, return to 
the offset vector algorithm. First, using eqn. (13) 
calculate the value yo of th e offset vector that cor- 
responds to x [ : 

Vo = g(x [)■ (39) 

Assume that 

g(x[) s g, + A g[ a g { ; (40) 

note that exact equality does not hold because Ag{ is 
computed from a linearised analysis. Now let 

y i = Vo g\ 1 

= Kv i) r ( 41 ) 

%2 = g( x i)- J 


This completes a new step in the iteration process. 
Observe that the same number of evaluations of g(x) 
are required as for the offset vector method. The 
difference is that x 2 is computed with the aid of an 
intermediate value x[ that is calculated by a finite dif- 
ference projection. 

From x 2 and g 2 the quantities 

Ax 2 = x 2 — x, ; Ag 2 = g 2 — gi (42) 

are calculated as illustrated in Fig. 3. In the two- 
dimensional case Ax b Ax 2 , Ag, and Ag 2 provide suffi- 
cient information to continue the search for x* by a 
finite difference method alone, provided the Ax’s and 
Ag’s are independent. In higher dimensions we can 
proceed as before, calculating an intermediate x 2 based 
upon finite difference projections in both Ag, and Ag 2 

* This is not an exact first prder calculation because the gradient 
in the direction A#, is computed from a finite difference. 



(o) x — space 



Fig. 4. Illustration of ortbogonalisation of the vectors Ag, 
with the associated transformation on the Ax, 

directions and using the offset vector to find corrections 
to x 2 in the remaining directions. Here we shall derive 
a recursion based upon orthogonalisation of the vectors 
Agi. 

Suppose Ag,, Ag 2 , Ax, and Ax 2 are given as shown 
in Fig. 4. The component of Ag 2 orthogonal to Ag, is 
given by 

Sg 2 = Ag 2 - (Ag 2 .t A J/A*r 

According to eqn. (37), the associated change in ^ 
required to accomplish the increment Sg 2 is given by 

Sx 2 = Ax 2 - (Agj.i^/IAg^Ax,. 

Defining Sx, = Ax! and Sg, = Ag,, we can calculate 
the change Ag 2 required to drive the projection of an 
rt-dimensional vector g 2 on the space of the orthogonal 
vectors Sg, and Sg 2 to zero. Requiring 

(Ag 2 + gi)^gi — 0; f—1,2. 
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we have . 


Ag 2 — (&2 ■ Mf |)* J|i (#2 ■ 

The associated change in x, Ax 2 , is given by 


Ax 2 - -(^2-*« f i/l S ^ll) S ^l - (*2- Wl^llW 
Having A g 2 and Ax* we can calculate x' u g 2 , yj, y 2 , *3» 
and g 3 from eqns. (38), (39) and (41) by increasing the 
value of each subscript by one. 

This reasoning leads to the following set of recursion 
relationships for deriving x i+ h having {x 0 , x„ . . . *,}, 
{g 0 > *i, • • • gi }, orthogonal directions {Sg u 8g 2 , . . . Sg 4 _ 
and the corresponding set of ‘influence’ directions 
{$*,, 8*2, . . . 8*,-!}: 

gi = g( x i) ) 

Axi = x,- x^i 


Ag, = gi ~ gi-\ 

i-i 

hi = Ag, - 2 (A *,./«„)/#„ 

j - 1 

8*, = A*, — 2 

y'“ 1 
I 

A gi ~ 2 (gi'i&gjlkgj 

j = i 

a*; = - s 

— */ + a*; 

£/ = gi + Agl 


(43) 


Vi - i = *0/) 

Vi = yJ -i - gi 
*/+ 1 = %/)• 


To start the process, two iterations of the unmodified 
offset vector method are performed to provide values of 
x 0 , x i > go an( * g i- For i ;> n, we can discard all 8xj and 
hj for j ^ / — n; one set of directions is then effectively 
removed at each step to be replaced by 8g t and 8x f . 
Furthermore, for i ^ n the last four expressions of 
eqn. (43) can be disregarded if we let 

x H , = x ■; i ^ rt. (44) 

That is, a Newton-Raphson-like procedure, using 
approximate derivatives can be substituted for the offset 
vector method at the nth step. 


8. Summary and conclusions 

The offset vector method presented here is one that has 
been utilised to solve mathematical problems arising 
from special applications. The technique has evolved 
in this fashion because it requires knowledge of an 
approximate solution whose availability is dependent 
upon the physical situation. The purpose of this paper 
is to give the method more formal status as a numerical 
technique by presenting a recipe for its implementation, 
by developing criteria for convergence, and by illustrating 
its advantages through examples. It is found that the 
convergence rate is generally slower than that of second 
and higher order methods, but each iteration is relatively 
rapid to perform. Possible applications are those where 
few iterations are required or as a starting procedure for 
higher order methods when many iterations are 
necessary. 
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Abstract 

A linearized theory is developed for minimum fuel guidance in the 
neighborhood of a minimum-fuel space trajectory. The thrust magnitude is 
unrestricted so that the thrust is applied impulsively on both the nominal trajec- 
tory and the neighboring optimal trajectories. The analysis allows for additional 
small midcourse impulses as well as for small changes in the magnitude, direc- 
tion, and timing of the nominal impulses. The fuel is minimized by determining 
the trajectory which requires the minimum total velocity change when summed 
over all the impulses. 

The analysis is deterministic and applies to arbitrary time-varying 
gravitational fields. Three separate time-open problems are treated; rendezvous, 
orbit transfer, and orbit transfer with tangential nominal impulses. 

* th 
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List of Symbols 


6R Position deviation 

t^ Nominal final time 

6t Time deviation 

Component of midcourse impulse in the critical plane 
u nc Component of midcourse impulse in the noncritical direction 

V T Velocity of target trajectory 

Vj Velocity of nominal interception trajectory 

AV Nominal terminal impulsive velocity change 

£6 V Total change in impulsive velocity cost 
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Introduction 

This is the second of a series of papers on minimum fuel guidance 
of high-thrust rockets. The first paper (Ref. 1) illustrated the general approach 
by treating the particular problem of guidance from a hyperbolic to a circular 
orbit. The succeeding papers are intended to generalize this approach to more 
general classes of guidance problems. This generalization will be carried out 
in several stages. The present paper will consider the general case of time- 
open impulsive guidance. Later papers will extend the analysis to finite thrust. 

There is a well -developed theory for minimum fuel impulsive guidance, 
e.g. , Refs. 2, 3 and 4. However, these references consider only the case of 
an unpowered nominal trajectory. The nominal trajectory around which the 
analysis is linearized is a coasting arc. The present paper is intended to 
generalize these results to nominal trajectories containing one or more finite 
impulses. The analysis will consider three different problems. The first 
problem to be treated will be minimum fuel guidance for time-open rendezvous. 
The second problem will be time-open orbit transfer, and the third problem 
will be an important special case of the second, where one or more of the 
finite impulses is tangent to the velocity vector. 
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Mathematical Model 

The analysis of the present paper is linearized about a nominal 
trajectory containing one, or more, finite impulsive velocity changes. This 
nominal trajectory must be an optimal trajectory minimizing the sum of the 
absolute magnitude of its impulses for transfer between its terminal states. 
The problem considered is the deterministic problem of determining the 
minimum impulse transfer from a given state in a close neighborhood of the 
nominal state at a given initial time to the terminal state with time open. The 
nominal trajectory may lie in a general time-varying gravitational field. The 
analysis is a first order analysis neglecting second order terms. It is arialo- 
gous to the neighboring optimal guidance schemes developed for smooth 
optimization problems without corners. The problem is complicated by the 
possession of corners and the possibility of introducing additional impulses. 
However, the problem is simplified because it is a first order analysis. In 
general, the problem will be to guide the vehicle from a given initial state at 
a given initial time to a final time in the near vicinity of the nominal terminal 
time. For the orbit transfer problem the final time may be allowed to become 
arbitrarily large; and it may also be possible to extend the initial time arbi- 
trarily far backwards in time. 
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Analysis 

I, Time-Open Rendezvous 

The key concept in analyzing minimum-impulse guidance for time- 
open rendezvous is the concept of a noncritical direction. This concept was 
originally developed for use in interception problems rather than rendezvous 
(Refs. 2 and 5) but is also useful in analyzing rendezvous. Consider the case 
where the nominal trajectory has a single finite impulse which accomplishes 
rendezvous at a nominal terminal time. If rendezvous were to be accomplished 
at a slightly earlier time fit, then the point at which rendezvous is accomplished 
must be displaced by the negative product of the target velocity vector and the 
time change. 

6R = - V T 6t @ t = t f - fit (1) 

This position is reached by the intercepter at an earlier time than the nominal 
arrival time. If the trajectory of the intercepter were continued to the nominal 
arrival time, it would have the position given by Eq. (2) and shown on Fig. 1. 

6Rj = - V T 6t + ^61 = - AT fit @t=t { (2) 

This indicates that, if the intercepter will intercept a specified line in space 
at the nominal arrival time, then it will (to first order) also intercept the target 
at a somewhat earlier or later time. This specified line passes through the 
nominal arrival point and has the direction of the nominal finite impulse. This 
direction through the nominal arrival point is known as the noncritical direction 
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at the nominal arrival time. It represents the one permissible direction of 
position variation which will still lead to rendezvous. This noncritical direc- 
tion may also be propagated backward in time by use of the state transition 
matrix. It will then define a noncritical direction at any point along the nominal 
trajectory. 

In order to effect rendezvous, it is necessary to control the two 
components of position variation in the plane normal to the noncritical direc- 
tion. This plane is known as the critical plane. Once the terminal position 
of the target vehicle and the rendezvous vehicle has been matched by reducing 
the position deviations in the critical plane to zero, rendezvous is accomplished 
by a finite impulse which nulls the difference between the target and inter- 
cepter velocities. To first order, only one component of terminal impulse 
variation adds linearly to the cost; that in the direction of the nominal impulse. 
Any small deviations in the velocity vector normal to this direction may be 
cancelled by small rotations of the nominal terminal impulse. Such rotations 
only increase cost to second order and may be neglected in a first order 
analysis. 

The foregoing considerations indicate that only two components of 
position and one component of velocity at the nominal final time must be con- 
trolled for time-open rendezvous. This reduces the original 6-dimensional 
parameter space to a 3 -dimensional parameter space. If there is only one 
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finite impulse, then the analysis for unpowered nominals in Refs. 2, 3 and 4 
may be applied without change to this 3-dimensional parameter space. That 
analysis indicates that the optimum solution has no more than three impulses. 
One of these impulses will represent a variation in the magnitude of the nominal 
impulse so that there are, at most, two midcourse impulses. 

The required position correction at the nominal terminal time may 
be accomplished with a single midcourse impulse. If this corrective impulse 
occurs at a specified time, then the optimum direction of this impulse may 
easily be calculated. One component of the impulse will produce the position 
correction. This component will lie in the critical plane. There will also be 
a component of the midcourse impulse in the noncritical direction. This com- 
ponent will be used to reduce the magnitude of the large terminal impulse and 
will result in an overall saving in impulse magnitude and fuel. The total change 
in impulsive velocity is given by Eq. (3). 


(3) 


(4) 


j 
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n 2 b | A v| a|Av| 

S6V = J u + u - — L u - L u 

c nc du nc d u c 
nc c 


The optimum magnitude of the velocity component in the noncritical direction 
may be found by differentiating Eq. (3), and solving for the stationary minimum 
point given by Eq. (4). 

Mli|u | 

du c 
* nc 




^AV| 

L C nc 


i 
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The total cost of the optimum correction at a specified time is given by Eq. (5). 



mi 

du 


( 5 ) 


In the particular case treated in Ref. 1, the midcourse correction 
should be made as early as possible and there will be only one midcourse 
impulse for the minimum fuel solution. This behavior will be typical of most 
cases as the time approaches the terminal time. However, in other cases as 
many as two midcourse impulses will be required to minimize the fuel con- 
sumption. It is also possible that a single impulse at a time later than the 
time under consideration may be optimum. There are both direct and indirect 
approaches to this optimization problem. The indirect method calculates the 
primer vector (Refs. 6 and 7) from the direction given by the optimum direction 
of a single midcourse impulse at the current time to the terminal impulse at 
the terminal time. If this vector is less than unity at all intermediate points, 
then the single correction will be the absolute minimum fuel solution. 

The direct method is a constructive approach utilizing the convex 
hull of the reachable set of terminal states (Ref. 2). This reachable set is 
constructed in a parameter space defined by the change in the terminal impulse 
magnitude and by the two position components in the terminal critical plane. 
Each of these parameters is normalized by the magnitude of the midcourse 
velocity change. An optimum maneuver must lie on the convex hull of the 
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reachable sets in this space. The set of all impulse directions at a given time 
will define an ellipsoid in the parameter space. Equations (4) and (5) will define 
a generator of a cone which is tangent to the ellipsoid and whose apex is at minus 
one on the velocity axis (see Fig. 2). If a single correction at the earliest pos- 
sible time is optimal, then the cones for all subsequent times will lie inside the 
initial cone. If two midcourse corrections are required, then the convex hull of 
all the cones will have a plane as one of its bounding surfaces. If a single cor- 
rection at a later time is optimal, then one of the later cones will project through 
the cone corresponding to the initial time. The geometric construction for these 
cases may be reduced to a 2 -dimensional construction by using the traces of the 
cone on the plane of the position variations. In exceptional cases where such 
traces do not produce closed figures, it may be necessary to use another plane 
that passes through the cones. 

If the nominal trajectory contains one or more large impulses before 
the final impulse, then all necessary corrections may be made by utilizing small 
variations in these impulses. It is only necessary to consider small variations 
of timing and direction of these impulses. Such variations allow control of one 
component of position and two components of velocity at the time of the impulse. 
These three components may then be propagated to the terminal state by means 
of the state transition matrix. Except in exceptional cases it will be possible to 
control all three required components of the terminal state by this means. This 
control will (to first order) produce no increase in cost. This is shown by the 
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fact that the primer vector passing through the two impulses of the optimal 
nominal trajectory is stationary with respect to small variations in impulse 
timing and direction. 

II. Time-Open Orbit Transfer 

If the object of the mission is orbit transfer rather than rendezvous, 
the particular phasing of the vehicle in the final orbit is unspecified. This means 
that there will be a set of noncritical directions arising from all points on the 
target orbit in the vicinity of the nominal terminal time. This set of directions 
will to first order define a plane in which will lie the velocity vectors of both 
the target orbit and the transfer orbit at the nominal terminal time. All trajec- 
tories which are close neighbors of the nominal trajectory and which touch this 
noncritical plane at the nominal terminal time will also intersect the target 
trajectory at a time close to the nominal terminal time. For the orbit transfer 
problem it is only necessary to control the one component of terminal position 
in the critical direction which is normal to the noncritical plane. The parameter 
space which must be considered is only 2 -dimensional, containing one position 
component and one velocity component. There will be at most one midcourse 
impulse in addition to small variations in the terminal impulse. The optimum 
midcourse impulse may occur at a time other than the earliest possible time. 

In fact, in some cases this single midcourse impulse should occur in the 
neighborhood of the terminal orbit rather than in the neighborhood of the transfer 
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orbit and at a time later than the time of the nominal terminal impulse. The 
latter case is easily analyzed by considering the set of reachable states in the 
vicinity of the terminal orbit, as well as in the vicinity of the transfer orbit. 

HI. Time-Open Orbit Transfer with Tangential Impulses 

In many orbit transfer problems, such as the well-known Hohmann 
transfer, the impulses are applied tangent to the velocity vector. In such a case 
the noncritical plane of the preceding section becomes undefined and it is once 
again necessary to consider a 3-dimensional parameter space possessing two 
components of position variation. This case is similar to the case of time-open 
rendezvous and possesses a noncritical direction and a critical plane. As in the 
preceding section, it may be desirable to consider midcourse impulses in the 
terminal orbit as well as in the transfer orbit. It is possible to have a midcourse 
impulse before the major transfer impulse in the neighborhood of the transfer 
orbit, as well as a post- terminal- time midcourse impulse in the neighborhood 
of the nominal terminal orbit. If there are one or more large impulses on the 
nominal trajectory before the terminal impulse, then variations in the timing 
and direction of these impulses may be used to control the trajectory. In the 
particular case of a Hohmann transfer, these variations will not be sufficient 
to control all out-of -plane deviations because the two impulses are located at 
singularities of the state transition matrix. In this case it will be necessary to 
utilize midcourse impulses in either the transfer orbit or one of the terminal 
orbits for controlling the out-of-plane component of the terminal position variation. 
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Conclusions 

(1) Minimum impulse time-open rendezvous in the neighborhood of 
an optimal nominal trajectory requires at most two small midcourse impulses 
if the nominal trajectory possesses one large finite impulse. Two midcourse 
impulses may be required if either the nominal trajectory or the deviations 
from it are nonplanar. If both the trajectory and deviations are planar, not 
more than one midcourse impulse will be required to realize minimum total 
impulse, 

(2) Minimum fuel, time-open orbit transfer in the near vicinity of an 
optimum nominal requires at most one small midcourse impulse if the nominal 
trajectory contains at least one finite impulse which is not tangent to the velocity 
vector. If both the nominal trajectory and the small deviations from it lie in the 
same plane, there will be no small midcourse impulse. In the latter case, the 
first order minimum fuel solution will be a single impulse at the intersection 

of the two orbits. 

(3) For both time -open rendezvous and orbit transfer with two or 
more finite impulses, no midcourse impulse will be required unless the finite 
impulses occur at singularities of the state transition matrix. 
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Actual Transfer Orbit 



Fig. 1 Intercept Geometry 
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ABSTRACT 

Objections to applying the spheroidal method to calculate a 
polar orbit of an artificial satellite are easily overcome. 

Previous papers have already treated the behavior in an exactly 
polar orbit of the right ascension gf, the coordinate for which 
the difficulty supposedly occurs. Just as in the Keplerian prob- 
lem, it remains constant, except for jumps of 180 ° at a pole. 

There remains the case of an almost polar orbit, for which 
the calculation of ^ may be inaccurate near a pole, unless one 
takes special precautions. The present paper first simplifies 
the expression for ^ for all orbits, polar or not, and then shows 
how to avoid the difficulty altogether, by solving directly for 
rectangular coordinates and velocities. These considerations 
apply both to papers by the author and by Izsak on the original 
spheroidal method and to the author’s later papers incorporating 
the third zonal harmonic into the spheroidal potential. 

The present paper simplifies orbital calculations by the 
spheroidal method for satellite orbits with all inclinations. Its 
main points are the bypassing of the right ascension and the 
avoidance of differences of almost equal quantities, so that all 
calculations become well-conditioned. 
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1 . INTRODUCTION 


Objections have sometimes been made to applying the author* s 
spheroidal method to calculate a polar orbit of an artificial 
satellite. The coordinates that appear are o , for which the level 
surfaces are oblate spheroids, n, for which they are hyperboloids 
of one sheet, and the right ascension rf. The apparent difficulty 
in a polar orbit arises only in and then only at a pole. 

For an exactly polar orbit I have already shown by limiting 
processes in V1961a and V1961b (1) that the spheroidal potential 
leads to = constant, except at a pole, where it jumps by + 180° , 
accordingly as we call the orbit direct or retrograde, respectively. 
This is the expected behavior, just the same as for a Keplerian 
orbit, so that no real difficulty appears. it holds whether or 
not the model takes into account the third zonal harmonic, with 
coefficient . 

Although the difficulty was easily disposed of, without 
tedious numerical calculations, for an exactly polar orbit, one 
might still claim that it remains troublesome for an almost 
polar orbit. For such an orbit the calculation of pf involves 
a small denominator which almost vanishes near a pole. One then 
may very likely lose accuracy in passing by the pole or have to 
use special procedures which will increase computer time and 
storage demands and which will not elsewhere be necessary. The 
present paper shows how to avoid such difficulties. 

2. THE AUTHOR'S SPHEROIDAL SOLUTION,' WITHOUT J 3 

The notation in this section is that of Vl961a, corrections 
of which are to be found in Walden and Watson 1967, p. 16. The 


rectangular coordinates X, Y, 2 satisfy 


2 o 2 

X + iY = (P +c ) (1 -tT) exp i 

(1.1) 

Z = PT] 

(1.2) 

Now by (8.50) of Vl961a, 


^ a O' + F , 

(2) 


1. The initial V refers to the author's own papers. 
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where F is that part of the expression which varies rapidly near 
a pole. Here O' is given by Eq . (9) of the present paper and 

F - KX (3.1) 


where 


But 


K = | K t sgn a 3 , 


K 2 = a 3 2 n 0 2 ri 2 2 (a 2 2 -a3 2 )' :l (T, 0 2 + T, 2 2 -l-T lo 2 T l2 2 ) 


-X >. 


(3.2) 

(4) 


T) 0 2+T1 2 2 = l+a 2 V 2a l c2 > 


“I 


2 2 2 2 0 

VV * <a 2 -ap<-2aiC 2 ) 


(4.1 of V 1961a) 
(4.2 of V 1961a) 


It follows that K * 1, so that 


K - sgn * +1 ( 5 ) 

for direct or retrograde orbits, respectively, in order that the 
right ascension $ may correspondingly either always increase or al- 
ways decrease. Then 

= ft' + X sgn a 3 (6) 

is an exact equation for all orbits, with the spheroidal model. 

This is in contradistinction to the results of VI 961a, where it 
was only shown to hold for polar orbits. Thus the present work 
simplifies all calculations with the spheroidal model. 

To find the rectangular coordinates X and Y directly, without 
first calculating insert (6) into (1.1), use 

__ 1 

exp iy * (1 -t iq^ sin 2 i)r) (cos if + iJl-r\ Q 2 sin y) (7) 


from the last paragraph of Vl961b, and then put p^r^sin ijr and 

(l-n^ 2 ) 1 / 2 =| cos l|, from (6.4) and (4.7) of Vl961a. The 

2 1/2 

troublesome denominator ( 1 -T| ) ' then cancels out, with the 
result 
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2 2 

X+iY = (P +c ) (cos 4 +i cos I sin \Ji)exp iP* 


( 8 ) 


for all orbits, direct or retrograde. Here 

1 

2 

* 3 3 +a 3 (a 2 2 -a 3 2 ) n 0 (B 3 *+ sin 


-c 2 a 3 (-20^) “ 1 ^ 2 (A 3 v+ T A^sinkv) 


( 9 ) 


Separately 


2 2 

X*(p +c ) (cos O' cos \li-sin r> ' cos I sin '>*) 


( 10 . 1 ) 


Y*(P 2 +c 2 ) (sin O' cos + cos 0* cos I sin ♦ ) (10.2) 


These expression contain no singularities or rapidly varying 
quantities, so that there is thus never any difficulty with a polar 
or almost polar orbit. For a strictly polar orbit cos I and a 3 
both vanish, so that P'=0 3 and 

(ID 


2 2 1/2 

( o +c ) cos t exp i3 3 


X+iY 

3. Izsak's Spheroidal Solution 


Although Izsak (1960, 1963) suggested using a slowly rotating 
reference plane to avoid the polar difficulty, actually the same 
transformations hold for his solution of the spheroidal problem. 
For the sake of accessibility, I shall refer to his 1963 paper, 
in making the comparison, note that my symbols are to be changed 
as follows: -+ a, r\ o , r] Q -> s , and 0 3 -> 0*,' others remain the 

same. Then, with use of Izsak's Eqs. (3), (91), (37), and (63), 

one finds again the equivalent of the present Eqs. (10) for the 
rectangular coordinates X and Y. Note that Izsak's expression for 
O' contains (l-s 2 ) 1//2 in the numerator and 1-e in the denominator 
of each term except 0 *. The 1-e 2 in such a denominator does not 
necessarily produce a singularity as e 1 , since each (1-e ) 
is multiplied by v-c/a and p - a (1-e 2 ) is a quantity analogous 
to the semi-latus rectum in a Keplerian orbit, in such an orbit 
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p > 0 for any orbit that does not intersect the center of the 
planet, even if e=l , Incidentally, the same powers of p occur in the 
coefficients . A^ and the ' s of V1961a. 

4, Isolation of the Right Ascension 

In either solution, the quantity here called x is the sensi- 
tive part of the expression for the right ascension If one 
actually wants values of near a pole in an almost polar orbit, 
it is better to rewrite Eq . (7) as 

_ 1 

2 2 2 ^ 

exp i\ = (cos ifrcos I sin if) (cosif -fi|cos 1 1 sin if) 

( 12 ) 

One thus avoids calculating the difference of two almost equal 
numbers in the denominator. Then ^ is given by (6) and (12), 

5. Velocity Components, with J^=0 


On taking the logarithmic derivative of (8) and multiplying 
the result by X+iY, we find 


so that 


X+iY = | P ~ 2^ ~ 2 + (X+iY)+ (0 1 2 +c 2 ) (-sint +i cos I cos 

\ p + c J 

V f (13) 


X m — X-YO’+ (p 2 +c 2 ) (-sin t|t cos O' -cos I cos if sin Cl*)if 
p +c 


(14.1) 


1 

. 2 

Y = — ^2 Y+x^ ’ + (p 2 +c 2 ) (-sin sin O' + cos I cos if cos fl')| (14.2) 

p +c 

Differentiation of (1.2) gives 


Z = ‘np+pr) = np+hgP cos If if (15) 

These equations contain neither small denominators nor differences 
of almost equal quantities. Here 
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•fe) 


-1 


(p 2 +Ap+B) (p 2 +c 2 , n 2 ) sin E 


from p. 6 of Bonavito 1962, and 

1 1 

2 2 

*n = ^qCos't ( a 2 2 “ a 3 2 ) (l-q 2 sin ¥) (p +c 2 r) 2 ) ^cos 

from p. 15 of Walden 1967 r after a few transformations. Here 

q= T l 0 / T ^2 * Then 

1 1 

* -1 2 2 2 2 2 2 2 2 2 
♦ = rig (a 2 -a 3 ) (P 2 +c h ) (1-q sin \|i ) 

Finally, by Eq. (9) of the present paper. 


a 3 ^ a 2 2 ” a 3 2 } W 16 V^« s 24,)i 


2-4 


1 

2 “ 2 4 
-c a 3 (-2a 1 ) (A 3 + I kA^^cos kv)v 


Thus we also need v. With 


o = (1+e cos v) ^p 


from (5.12) of Vl961a, where p=a(l-e ), we find 
* e 2 

p = — p sin v v 
P 

Comparison of (16) and (21), with use of the anomaly connection 


sin E = (1-e ) sin v 


then gives 


• a 

V " P 


U(l-e 2 ) 


2 


(p +AP+B) 
p 2 +c 2 r| 2 


Eqs. (14) , (15 ) , (16 ) , (18) , (19) , and (23) then give the complete 


(16) 


(17) 


(18) 


(19) 


( 20 ) 


(21) 


( 22 ) 


(23) 
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algorithm for finding the velocity components in the spheroidal 
model, when is not included. 

6 . The Author * s Spheroidal Solution, with 

The notation in this section is that of V1966, corrections of 
which are to be found in Walden and Watson 1967, pp. 19, 20, 22, 

27, and 31. With this solution 

& - O’+G sgn ql^ , (24) 

where o' is given in Eq . (41.4) of the present paper and where 

G is given by Eq , (150) of V1966, viz 

1 _ I 

G= la 3 |a“ 1 u 2 (l-S) 2 [(h 1+ h 2 ) x 0 +(h 1 -h 2 )x 1 J (25) 


From Eq.(158) of VI 966, we have ^ ^ 

— 1 ^ 2 7^ 

(h 1 +h 2 )x Q +(h 1 -h 2 )x 1 =2 (1-C 2 ) [ (1-C 2 ) -C 1 ] (E 2 '+E 3 *) (26) 

If u is a solution of the cubic equation (27) of V1966, then by 
(32.1) and (32.2) of that paper 


2 

C, - ~~~ (16), C 1 =2u6 Pn " i (l-C 0 S)’ i (l-C 0 ) 1 


a nP 


0^0 


so that 


(27) 


(i-c 2 ) 2 -c 2 


l-c. 


where 


1_ __u_ (k _ai_\ 

1-8 I \ u a oPo ) 


(l-S)-R 


(28.1) 


R 5 l u ' a n p, 


0^0 




a nP, 


0 F 0 


By (27) of V1966, however, 
_2 


R ~ u -1 


a nPi 


0^0 


(1-S) 


(28.2) 


(29) 


Insertion of (29) into (28) then shows that 
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(1-C 2 ) x [ (1 -c 2 ) 2 -c 1 2 ] = (l-s) X (u-S) , 

which, with (26), gives ^ ^ 

“ 2 2 

( h l 4 ' h 2 )x 0 +(h l' h 2 )v l = 2 (u ' S) (1-S) (E 2 ,+E 3 ,) 

Now, by Eqs . (21.2), (18), and (26) of V1966, for all orbits, 

direct or retrograde, 

I i 

| a 3 I a 2 u * (u-S) 

Then, from (25), (31), and (32), 


(30) 


(31) 


(32) 


G = J (E 2 , +E 3 t ) (33) 

for all orbits, polar or not, and direct or retrograde. This 
is the same as the expression given in Eqs. (159) of V1966 for the 
sensitive part of & in the case of a polar orbit. Here, however, 
we have shown that it holds for all orbits. 

To evaluate G, place E 2 *= E 2 * (1+^/2) and e 3 1 =E 3 * ('ll- j) into 
Eqs. (104) of V1966. The results are 


e 2 ~sin Hi 

cos E 2 ' = l-e' 2 "sin * 

1 

2 2 

(l-e 2 ) cos * 
sin B 2 * * l-e 2 sin “Hi 


cos E^ ' 


e^-fsin H> 

1 +e 3 sin Hi 


sin E 3 ' 


1 

2 ^ 

(1-e ) cos * 

l+e 3 si'n "5 


(34) 


where 

e 2 =(l-P) _1 Q, e 3 =(l+P) _1 Q , Q 2 «P 2 +S , (35) 

with 0 < e 3 < e 2 < 1, by Eqs. (100) and (47) of V1966. Then, by 
(33) , 

cos(E 2 '+E 3 i ) = cos 2G = 2 cos 2 G-1 (36) 

From (34) and (36) it then follows that 
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_ 1 / \ 

2 2 (l+e e + J{ 1-e 2 ) (1-e 2 ) ) cos f 
cos G = Mf) ' — -J — . (37) 

[ (l-e 2 sin if ) (l+e^sin ^ ) ] 2 

where k(|)=+l. 

We now show that k(i|i)=l for all First note that E 2 *(y) 

is related to y in the same way that an eccentric anomaly is re- 
lated to a true anomaly. The same holds for E 3 '(y), Thus each 
increases as y increases, by Eq . (160) of V1966, so that G=2 x 

[ E 2 ' {y+ v /2 ) +E^ ' (^-tt/ 2) ] is a continuous monotonically increasing 
function of ^ . 

Also, from the definitions, E 2 '(y) and E^'fy) are both equal 
to nTT for y = nn. Thus 

G = ^ for ij( = (n+ ~)tt, (n«0 , 1 , 2 . . . ) (38) 

so that cos t and cos G both vanish for ^ = (n+ ^)tt. Now consider 
a small interval (n+ 2 ) n-e 1 ^ = (n+ 2 > n + £ • Since G always 
increases with increase in the corresponding changes A cos t|i 
and A cos G are both negative if n is even and both positive if n 
is odd. Thus k(ijf) >0 over any such inteval. But k(^) = ±1 for 
all f and since cos G and thus k ( >1/ ) are continuous functions of i , 
it follows that 

k(^) = 1 for all | (39) 

Before we rewrite (37) with omission of k(*ji), let us first 
simplify it. To do so, note that by (35) and by (48) of V1966, 
whi ch i s 

■q = p + Q sin ^ , (40) 

we obtain 

(l-e 2 sin y ) (l+e^sin f ) = (1-P 2 ) 1 (l-‘n 2 ) (41) 

Now from Eq . (32.3) of V1966 

2 P = r (1-S) 6, (42.1) 
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where 



r = 2(1-C 2 S) 1 u/p o 

Thus 6 = O [J 2 ) and r = 0 ( 1 ). Eqs. (35) and (42) then show that 


(42.2) 

(42.3) 


l+e 2 e 3 + ,/(l-e 2 2 ) (l-e 3 2 ) = (1-P 2 ) (1+S+ (l-S) % /l-r 2 6 2 ) (43) 

On inserting (39), (41), and (43) into (37), we find 


cos G = 2 2 (1-T| 2 ) [ l+S+(l-S) x /l-r 2 6 2 ] cos * (44) 

We also need sin G in calculating rectangular coordinates. 

To evaluate it unambiguously first note that 


2 sin G cos G = sin (E 2 , +E^ 1 ) 

= (l-n 2 ) _1 (l-P 2 )[ (e 3 ,/l-e 2 2 -e^/l-e^H {^l-e' 2 ' 2 +^l-e 3 2 
from (35), (42), (44), (45), ana (46) 


{ Q (/l-r5 ~/l+r6)+[ (l+P)/l-r6 + (1-P )/l+r ft] sin* } 

[ 1+S+ (l-S)'vi-r 2 6 2 ] 2 (47) 

1 /2 

To check this, note that for J- = 0 we have 6=0, P=0, Q=S , 
2 J 

and S = sin I, so that (47) then reduces to 


by (24) and (31) 
follows that 


1 


sin G = 


( 1-S ) 


,, 2 . 2 
(1 -Tl ) 


(45) 

) sinty ] co s\li 

(46) 

it 


_ 1 

sin G = (1 -ti 2 ) 2 | cos l|sin if, (48) 

agreeing with (7) for sin x. 

If one really wants values of the right ascension near a pole, 
one can use (24), (44), and (47). 
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It is then advisable, however, to rewrite the l-r\ 2 in the denomi- 
nator by using (35) and (40) . One finds 


l-rj 2 = cos 2 |- (P 2 +2PQ sin if ) + (1 -S-P 2 ) sin 2 | , 


(49) 


resulting in the same kind of simplification near a pole as does 

(12) . 

Near a pole in a nearly polar orbit the term -(p 2 + 2PQ sin^) 
in (49) is much smaller than the positive term ( 1-S-P 2 ) sin 2 ^ . To 
verify this statement, note that in a nearly polar orbit, S^l, Q^l , 
P«l, and near a pole |sin 1. Then from (32.3) of V1966 



P=(1 - SqPq Su > 1 p‘ U < 1 - S ) - 6400 

(49.1) 

so that 

. 2 

|P + 2PQ sin 

~ 3200 

(49.2) 

and 

(l-S-P 2 )sin 2 ^ 

% 1-S 

(49.3) 

Thus Eq . 

(49) gi”es no trouble near a pole. 



In rectangular coordinates we find from (1 ) , (24) , (44) . and 

(47) 

1 


2 2 —1 - 

X *= (p +c ) [H,cos O' cos if -H.. /1-S sgna^sinQ’ (H^+H-sint) ] 

(50.1) 

1 

2 2 2 -1 — 

Y = (p +c ) [H,sin O' cos if +H, /1-S sgna^cosO' (H-+H. sin^ ) ] 

J 2 J (50.2) 

and 

Z = p r\-b , (50.3) 

from (1.2) of V1966. Here 

_ 1 f 1 


H i 

= 2 2 [1 + S + (1-S) Jl-r 2 b 2 ] 2 

(51.1) 

H 2 

= ^ Q ( /1-r 6 - /l-r6 ) 

(51.2) 

H 3 

= |[ (l+P) /l-r 6 + (1-P)/1 + r6 ] 

(51.3) 
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and 


20 ^) 


(A 3 v 


4 

+ 7 A 
k=l 


3k 


sin kv) 


1 

+a 3 a 2 lu2 ( B 3^” 4 C 1 C 2 Q COS ^ + — c 2 2Q2sin (51.4) 

from Eq . (150) of V1966. Like Eqs.(lO) these equations contain no 
singularities , even for a polar orbit. Moreover they hold for all 
oribts . 

For an exactly polar orbit we have S=1 , P=0, Q=1 « ct 3 ~0, and 

The X and Y equations then become 


X + 


2 2 2 

iY = (o +c ) cos 'll exp i0^ 


(52) 


as for the case J 3 =0 of Eq . (11). The Z equation, however, is 

z = 0T1-6, where 6 = (r e /2) j' 1 1 J3 1 . so that the orbit is still changed 
by the J 3 . 

7. Velocity Components, with Accounted for 


From Eqs. (50.1) and (50.2) 

1 1 

2 2 

X+iY=(o 2 +c 2 ) [H 1 cos^+iH~ 1 (1-S) sgna 3 (H 2 +H 3 simi! ) ] exp iO* (53) 

Logarithmic differentiation of (53), with multiplication of the 
result by X+iY, gives 


1 I 


X+iY 55 ( 22 
p +c 

so that 


iL+iQ.) (X+iY) + (o 2 +c 2 ) 2 [-H sin^+iH^d-S) sgna^cos* ] *ex P if 
° (54) 




X= 


2 2 
p +c 


— 

- Y r ' + (D 2 + C 2 ) 2 t -H 1 sinicos r ' “H ^ 1 ( 1 -S) 2 sgna 3 H 3 cosisinC ’ ] i|| 


(54.1 
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l 1 

Y= 2^2' Y^XQ'4 ( p^c^) 2 [ -H^ sin^sinQ ' ( 1 — S ) ^sgna^H^cos^cosfl 
p +c 

Also 

i = np + 

by (15). Eqs . (16) and (23) still hold, so that the equations 

p and v are as for J 3 =0. 

For U' we find from (51,4) 

- 1 4 

Q*=--c 2 a 3 (“20^ ) 2 (A 3 + t kA 3k .coskv)v 


-1 2 3 3 2 2 

+ a 3 a 2 u (B 3 + -■C 1 C 2 Qsmif+ ie C 2 Q cos 2 tM 


The new expression for \jr is still lacking. From p. 14 of 
Bonavito 1966, we find 


T] = Q cosf f = ~22 2 
P +c r\ 


up o¥, „ 

rP 


Ti-C 2 T ) ) cos ‘I 


so that 


r 


1 

2 v 2 


V = 


(l-CjTl-Cjll ) 

2 2 2" 
p +c n 


Here 


- = 1+ — - (l-S)- 
u a O p O 


( 2 iJ 


(1-S) (1- 


a nP 


S) 


1+ ~T“ (1-2S) ] 


°*r- + o(j 2 4 ). 


a o p o 


* ^ „ -L ^ J. ~ ^ 

c = c 0 ' 6 = r e J 2 " 4 r e J 3 J 2 


h 

(54.2) 

(54.3) 
for 

(55) 

(56) 

(57) 

(58) 

(59) 
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The equations of this section reduce to those of Section 5, if 
is equated to zero. 

8. The Improved Algorithm for the Spheroidal Model, with 

Begin with Section 12 of V1966 and follow it through the 
third line on p.45, viz, n = p+Q sin 'l' . Instead of then calcu- 
lating E 2 ' and E3 ' , however, replace that calculation with Eqs . 
(42), (50), and (51) of the present paper. This changed pro- 

cedure not only simplifies the calculation of X, Y, and z for 

near-polar orbits but bypasses the right ascension in all cases. 

♦ • * 

To calculate the velocities X, Y, and Z, use Eqs. (54) through 
(59) of the present paper. 
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Appendix I 


Algorithm for Satellite Position Vector and Velocity, 
if the Potential V = -uo ( p^+c^t^ )- l( _ qj 


Given 


r e * J 2 a ' e ' Zl ^1' ^2' ^3 
Compute once for each orbit : 

c Q 2 =r e 2 J 2 , ri 0 =sinl, p=a(l-e 2 ). D=(ap-c Q 2 ) (ap-CQ 2 ^ 2 ) +4a 2 c 0 2 ti Q 2 
D'=D44a 2 c 0 2 (l-n 0 2 ) , A=-2ac 0 2 D _1 (l-Ti 0 2 ) (ap-c Q 2 Ti 0 2 )<0, B=c 0 2 ti 0 2 d -1 d’ 


b,» - ^A>0, b,=B 2 , a n =a+b 1 >a, p Q = -c 0 2 a 0 1 (l-n 0 1 )+aa 0 1 pD 1 D* 


i 

2 _ 2 


a 2 =(up Q ) . a 3 =a 2 |l 

“2’ = a 2< 1+ 


2 2 \ 

jcosl. TV 

a o p o j 


2 C 0 D -1 
2 “ apD' ' q= ^2 



2 . 
cos I) 

a o p o 

: n (x) 

n_ . 
* x P n 


compute 


Al= (1 -e 2)2 pJ 2 (^ P^) R n . 2 l U- 2 ) 1 ] 


1 1 

A_=(l-e 2 ) 2 p _1 7. (b 2 /p) n P n (b 1 A 2 )R n t (1-e 2 ) 2 ] . where 
n=0 

°2i = * (-l) i ' n (c 0 /p)2i " 2n(b 2 /p)2np 2n (b l /b 2 ) 

n=0 
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D 2i + 1= * <- 1 > 1_n < c (/P>' <Ve)' *2n + l‘ b l /b 2> 


n=0 


A,= (l-e 2 )V 3 E D m R m+2 [(l-e 2 ) 2 ] 


m=0 


1 _ 2 l 3 2 15 4 175 6 

3 X = 2 n q [K(q)-E(q)]= 2 + 16 q + q ' 


128 M 2048 


q +. 


B 2 = 2n 1 K(q) = 1+ |q 2 + q 4 ^ §fg 


a 0 ' = a O'" A l +C 0 2T 'o 2A 2 B l B 2 1 


( 2 m): m ; 1 ( 2 n):r l 0 


2n 


2 2m (m : ) 2 n=l 2 2n (n I ) 2 


- 1 

B 3 =1-(1- - =, Vb 2m 

m=z 


A 1X = | (l-e 2 ) 2 p' 3 e(-2b 1 b 2 2 p+b 2 ) A 12 = p~ (1-e ) b 2 e 

A 21 =(l-e 2 ) 2 p" 1 e[b 1 p _1 + (3b 1 2 -b 2 2 )p 2 - j b ]L b 2 2 (l+~)p 3 + | b 2 4 (4+3e 2 )p 4 ] 


A 22 =(l-e 2 )V 1 [^(3b 1 2 -b 2 2 )p- 2 - |e 2 b 1 b 2 2 p- 3 + ^ b/ (6e 2 + e 4 )p- 4 ] 


A 23 =(l-e 2 ) 2 p 1 |- (-b lb 2 ' V J +b 2 V“> . A 2 4 = 256 (l- e ‘)‘p "V e ‘ 


2-3 ,, 4_-4 


2 . 2 -5, 4 4 


A 31 =(l-e 2 ) 2 p" 3 e[ 2+b lP ” x (3+ -p'* (4+3e‘) j 


- 1 „. 3 2 , - 2 , 1 . 2 2 , 
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;Sj + fb.p-V-p " 2 (f- - |- 2 ) <^ 2 2 -c 0 2 )] 

L -1 1 -2,1 

12 b l P "F ( 2 U 2 ^0 ”34" 32' 


,, 2,2 -3 

A 32 =U-e ) P 

1 I 

A 33 =(l-e 2 ) 2 o 3 e 3 [-^b 1 p 1 -^P 2 (2 b 2 2+c O A 34 = “ 32* 1_e * p e *2 b 2 +C 0 * 


2Trv 1 =a 


‘■‘ft) 


-1 


2rrv 2 =a^ ^ a 2 * * e '=a rt A ae < e 


-1 

a O 


X l* P l- C 0 2 ^2 a 2 lT '0 2B l B 2 1 ' X 2 =f3 l +P 2 a 2 1 < a O +A l )J ^ 


1 

, _ /_y\ 2 _ .. .-i . - - -i- ■- 2 - 2 - - - -i 

3 ' I* O 


a 2 <A 2 B 2 ' X 4 = 3 0 (A 1 +C 0 ^0 A 2 B 1 B 2 ) 

1 _1 

= C 0 2 (i^) = (ijf) a 2 , ®2 1 ' X 7 = 8 q2 ®2 1 


For each point at time t , now compute 


1) 

M =2 ttv 1 (t+X, ) 
s i 1 

lit * 2TTV n (t+\ 0 ) 

s z z 


2) 

Solve for Eq : 

M s + V e ' sin(M s +E 0 )=M s 


3) 

To find v Q : 

Place E=M s +Eq in the anomaly 

connections 


cos v = (1 
and solve 

-e cos E) -1 (cos E-e) , sin v 
for v = M +v Q 

= (1-e cos E) 


4 > *0 = X 3 V 0 

5) Compute = -X^Vq+ 4 X 5 s i n ( 2 ♦ s +2 ♦ Q ) 
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6) Then E^f 1-e 'cos (M g +E Q ) |e • [ 1-e • cos (M s +E Q ) ] ^M^sin (M s +E Q ) 

7) Place E=M s *E 0 rE 1 in the anomaly connections and solve for 

v=M s +v 0 4v 1 

2 

8) Then | 1 = x 6 f A 2 v l + k L ’ 1 A 2k sin (kM s -*-kv 0 ) ] +K ? sin (2\f s +2^ Q ) 

9) Compute 

-1 2 2 

M 2 =- a 0 [A l V l + k ^ 1 A llc sln(kM . +ltv 0> +X 5{B 1 t 1 - | V OS < 2 t s + 2t 0 )- f «in(2| a +2t 0 ) 


+ sin'(4^+4* 0 )|] 

10) Then E 2 «[ 1-e • cos (M s +E 0 ^-E 1 ) ] ~ X M 2 

11) Place e=m s +e 0 +e 1 + E 2 in the anomaly connections to find v=M s +v 0 +v 1 +v 2 . 

12) Then t 2 =X 6^ A 2 V 2" A 21 V l cos (M s +V 0 )+2A 22 V l GOS (2M s +2v 0 ) 

+A 23 sin ^ 3M s +3v 0 )+A 24 sin ( 4M s + 4v 0 ) ]+2^ ? [ ^cos (2 V & -^2 Hr Q ) 

3a 2 o .2 

+ ~f~ 3in(2| s 4-2t 0 )- --gj sin (4^+4^)] 

Then 


E=M s +E 0 +E 1 + E 2' v=M s +v o +V 1 +V 2' t=ts + 1'0 + Tl + t 2 


13) p— a (1-e cos E)=(l+e cos v) ^p, nsT^^sin ^ 


14) fi'=p 3+ a 3 a^ 1 (B 3l f+ " (A^ 2 A^sin kv) 


3 _ 2 -4 


Then the rectangular coordinates are given by 
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16) 

17) 

18) 

19) 

20 ) 
21 ) 

Then 

22 ) 

23) 

24) 


2 2 2 

{=(0 +c Q ) (sin O' cos Ilf 4- cos O' cos I sin *) 


l = pin 


To find the velocity components, compute 


‘ _ a 
“ o 


■^( 1-e 2 ) 


(n 2 -fAo+B) 2 
2 2 2 
P +C 0 ^ 


; e 2 . 

3 a — P smv V 
p 


02 ' (l-q 2 sin 2 ^f) 2 

2 2 2 
D +C Q t 


V=a 3 a^ 1 ( B 3 + yf n 0 2 n" 4 cos 2t)» 




kA^cos kv)v 


0 0 * 2 2 2 • 
~~ 2 ~ 2 ~ X-Y0'+ ( p +c Q ) (-sinicosQ' -cos I cos^sinO ' ) 

0 +c Q 


f= 


-££_ 


2 2 
P +c Q 


Y+X0*+ (0 2 +c 0 2 ) 2 (-sin* sinO'+cos I cos t cos 0')i 


5 = TID+TIq d cos f ty 
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Appendix II 

Algorithm for Satellite Position Vector and Velocity, 

2 2 2 -1 

if the Potential V = -u(o+r:6)(p +c r| ) (j ^0) 


Given u, , a # e # S , ^ ' ^3 

Compute once for each orbit 

|J 


3 ' 2 „ 2 ,2 2 . 
0 -e u 2 * 2 -e J— • c 0 ~ 6 * p=a(1 - e > 


6= i r 


2 2 

-2ac 2 (ap-c 2 S ) (1-S ) + ***? C * 2 


A = 


** | l + ~ ( 3 S - 2 )} s ( l - S ) 


(ap-c 2 ) (ap-c 2 S)+4a 2 c 2 S+ 6 2 (3ap-4a 2 -c 2 ) S (1-S) 


B = c 2 +{ 2 a) ^(ap-c 2 )A, b^= - ~ A , a^=a+b^, b^^B 

1 

-1 2 2 
P 0 =a Q (B+ap-2Aa-c ), Oj-tupg) , u from 

' ( 1 - S ) 1 1 £ — - s ) 

V p o / a o p o / 


u _1 =l+ t 5 — (1-S) + 
a 0 p 0 


2 ~ ' C 2 = rV u > a 3 = ± a 2 (l“Su X ) 

U+iJr < 1 - 2S > ]2 0 0 + for drrect orbl 

- for retrograde 


2 2 2 
C. = (1- SU)' 1 ^ u(l- -S— u). p=(l- -£ — SU)' 1 ^ u (1-S) , 

1 a (r 0 Pn a nPn a nPp 


3 CFO 


a 0^0 


With R n (x)=x n p n (x 1 ) , compute 


1 1 
A 1 =(l-e 2 ) 2 P n nb 2 /p) n P n (b 1 /b 2 )R n _ 2 [ (1-e 2 ) 2 ] 


1 I 

A,= (l-e 2 ) 2 p _1 “ (b_/p) n P n (b A 2 ) R _[( l - e 2 ) 2 ] 
2 n=0 2 x 
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i 1 

A->=(l-e 2 ) 2 p 3 Z D R [(1-e 2 ) 2 ], where 

a ni iii j £ 

m=0 


° 2 i = E (_1)1 n (c/p) 2 l_ 2 n (b,/p) 2 n p,„(b 1 /b 0 ) 


n =0 


2n 1 2 


° 2 i + l= n f Q < "I > 1 * * * * * * * ' n <=/P) 21 ' 2n W < b i A 2 ) 


* 11 = 1 (l-e 2 ) 2 p‘ 3 e(- 2 b 1 b 2 2 p+b 2 4 ) A 12 = 3 I p" 3 (l-e 2 ) 2 b 2 4 e 2 


* 21 = 


(l-e 2 ) 2 p 1 e[b 1 p" 1 +(3b 1 2 -b 2 2 )p 2 - | b ^ 2 ( 1 + | )p 3 + |b 2 4 (4+3e 2 )p 4 ] 


A 22 =( 1 _e 2 ) 2 p" 1[ l ( 3 b l 2 - b 2 2 ) P 2 - I e 2 V> 2 V 3+ jf b 2 4 ( 6 e 2 +e 4 )p 4 ] 


,2. 2 -1 e , , . 2_-3 4 -4 


2 \ 2-5, 4_4 


*23= (1_e > P | <- b l b 2 p + b 2 P ). *24= 256 <!"• > 


1 

A 31 =(l-e 2 ) 2 p _ 3 e[2+b 1 p _ 1 (3+ | e 2 )-p 2 (| b 2 2 +c 2 ) (4+3e 2 )] 

A 32 =(l-e 2 ) 2 p- 3 [f 2 + l b^V-p 2 ^* | e 2 )(K 2 +c 2 )] 


1 4 4 1 


4 2' '2 2 


A 33 =(l-e 2 ) 2 p 3 e 3 [i 2 b 1 p 1 -i p 2 (| b 2 2 +c 2 )]. A 34 = - 5* <l-e 2 ) V* (^c 2 ) 


in -5,k 2 2,4 


Q = (P 2 + S) 1/2 


B 2=l- § C lP+ (§ Cl 2 + | C 2 )(i Q 2 )- fj C 2 2 Q 4 +0(J 2 3 ) 


. 1 ? 9 ^ 9 -> O/tlCOC, T 

B 1 = 2 Q +P ~“ 4 C l PQ " + i4 (4C 2 +3C 1 ‘) Q*+ iff C 2 ‘Q +0(J 2 -) 
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B 3 2°2 8 C 1 8 C 2 (1+ 2 q2 )+0(J 2 3 ), 


1 „ 3 


Q . 1 „ _4> 


B ll- - 2P3+ 8 C 1 Q ' B 12= - ( 4 + 8 C 2° >' B 13= - C 1 24' B 14= C 2 64 


B 21 = - C 2 PQ+ 16 C 1 C 2 Q + 2 C X Q ' B 22 = " 3 ^ ( 4 ^+ 30 ^ ) Q 2 + 3 C 2 2 Q 4 ] 


B 2 3 16 C 1 C 2° ' B 24~ 256 C 2~ Q ' r=2 (l"C 2 S) 

a 0' =a 0 +A l +c2A 2 B l‘ B 2 1 ' 2ttv 1 = (^) ( a 0 ,)_1 . 2nv 2 =a 2 u ^b' 1 (a^ )"\ 
e ’ =a e a . 1 


3 ^ 2 4 


>- 1-1 


-1 

\2 - ± 


X 1 0 1 C ^ 2 a 2 B l' B 2 1 ' X 2 = 9 l +S 2 a 2 1 ( a 0 +A l )A 2 ^ X 3 = [*tf a 2 u 2 ft 2 B 1 


X 4 = a 0 1 < A 1 + c 2 A 2 B 1 


* B 2 1) ' X 5= (a~j c2q 2 1u2 ' X 6*^ 2 “2 U V' 


ii 


H x *2 2 [1+S+(1-S) (1-r 2 * 2 ) 2 ] H 2 = I q[ ( l-r6) 2 -(l+ r 6) 2 ] , 


h 3 * j [ (1+P) (1-r 6 ) 2 + ( l-p ) (l+r 6) 



Compute 

for each point 

1) 

M s =2TTV l (t+X i ) 

, » s =2nv 2 (t+X 2 ) 

2) 

Solve for Eq : 

M +E -e * sin (M +E_,)=M 
s U s 0 s 

3) 

To find v • 

Place E=M_+E /X in the anomaly connections 

4) 

V 

*0 =X 3 V 0 

s u 
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5 ) Compute Mj= -X 4 v Q - 1 2 sin ( 2 ij( g + 2 ^ 0 ) 

-1 

6 ) Then E 3 =[ 1-e 1 cos (M s +E Q ) ] -|e * [ 1-e • cos (M S +E Q ) ] “ 3 M n 2 sin (M s +E Q ) 

7 ) Place E=M_+E^+E, in the anomaly connections and solve for v=M +v +v 

s u 1 s 0 1 


8 ) Then t 1 =X 6 [fl 2 v i + ^ i A 2 |t sin (kM s +kv Q ) ] -B^B^cos (Vt 0 > ' B 22 B 2 1 ‘ 

sin(2t s +2i(r 0 ) 

-1 2 l 

9) Compute M 2 =-a Q [ A i v i+ £ A i k s i n (kM s +kv Q ) +X5 | B i "ti +B n cos ( 1 f s + to ) 

+2B 12tl COs(2 'i's +2 To ,+B 13 cos(3l l , s +3 to )+B 14 sin(4 ts +4 'fo ) } 1 

10) Then E 2 =[l-e , cos(M s +E 0 +E 1 )] _1 M 2 

11) Place E=M s +EQ+E^*tE2 in the anomaly connections to find 

V=M S +V 0 +V 1 +V 2 

12) Then if 2 =X 6 [ A 2 V 2 +A 21 V l COS ( M s +v o) + 2A 22 v 1 cos (2M s +2v Q ) 

+A 23 sin (3M s +3v 0 )+A 24 sin < 4M s +4v o ) J “ B 2 1 ^ ~ B 21^1 sin ( t s + 'lo ) 

+2B 22h COS (2 ts +2 to> +B 23 COS (3 ts +3 to> +B 24 sin (4 t. +4 t 0 ) 1 
Then E=M S +E 0 +E 1 + E 2 , v=M s +v 0 + v 1 + v 2 . t=t s +t 0 +| 1 +! 2 

13) p=a (1-cos E) = (l+e cos v) 3 p, rj=p+Q sin i" 


14) n'=p 3 -c' 


- 1 
2 




4 I 

-12 3 

(A^v+ S A^^sin kvj+a^a^ u (B^ - ^C-j^Qcos^ 


+ 32 C 2 2Q 2 s in 2y ) 
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Then if sgn a^= +1 fcr direct or retrograde orbits respectively, 
the rectangular coordinates are 

1 1 

15) X=(o 2 +c 2 ) 2 [h 1 cos 0 ' cos’^-H^ (1-S) 2 sgna 3 (H^H^sinv ) sin H'] 

1 I 

16) Y= (p 2 +c 2 ) 2 [H 1 sinQ'cos\li+H 1 1 (1-S) 2 sgna 3 (H 2 +H 3 sinHO cosP' ] 


17) Z=p*n- 6 


Velocity Components 


18) v= 


2 2 2 2 
Cl ( P +AP 4 -B) _ 
2 2 

P +c h 


, * e 2 . 

19) p = p D sinv v 


^0 ,2 22 


) * = J (p +c n ) (l+ Cl n-c 2 h ) 


2 / \2 4 

21 ) O'- -c aJ^-| (A~ + I kA~.cos kv)v 

J \ a 0 / J k=l JKL 


+ a 3 a 2 1 u 2 (B 3 + sini<+ C 2 2 Q 2 cos 2\|l)i 
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1 I 

22) X= - 2 - P - P 2 X-Yn , + (p 2 +c 2 ) 2 [-H 1 sin4cosO , -H“ 1 (l-S) 2 sgna 3 H 3 cos4sinn , ]i 

P + C 

1 1 

23) Y- 2 P - P 2 Y+XO' + (p 2 +c 2 ) 2 [ -H 1 sin\lisinO , +H 1 1 (l-S) 2 sgna 3 H 3 cosil'cosO' ] i 

p +c 

24) Z= tip + pQcos ^ \|i 
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EXPANSION FORMULAE IN CANONICAL TRANSFORMATIONS 
DEPENDING ON A SMALL PARAMETER 


by 

Ahmed Aly Kamel 

Stanford University, Stanford, California 
ABSTRACT 

The theory of perturbation based on Lie transforms 
is considered. Deprit’s equation is reduced to a 
form which enables us to generate simplified general 
recursion formulae. These expansions are then modi- 
fied to speed up the implementation of such pertur- 
bation theory in the computerized symoblic manipula- 
tion . 


1. INTRODUCTION 

If a system is described by a Hamiltonian depending on a small para- 
meter, then canonical transformation can sometimes be obtained using a 
von Zeipel generating function (See for instance Brouwer and Clemence 1961) . 
In such a case, the transformat ion is implicit because the generating 
function is in mixed variables (the old coordinates and the new momenta). 

The shortcomings of von Zeipel* s method, when the generating func- 
tion itself depends on a small parameter, were felt by Breakwell and 
Pringle (1966), and Deprit (1966), when they used a von Zeipel generat- 
ing function to remove the short period terms from the Hamiltonian of 
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a particle in the neighborhood of the triangular points in the restricted 
problem of three bodies. Breakwell (See Schechter 1968) recognized, 
after comparing with Deprit et al (1967), that the long period part of 
the second order Hamiltonian derived in mixed variables was misleading, 
and that it was possible to obtain a different representation in terms 
of new variables only. Using this suggestion, Schechter (1968) obtained 
a more valid second order expression. Deprit (1968) attacked the prob- 
lem using Lie transforms and extended the expansion to include higher 
orders. In this paper Deprit' s recursive algorithm is reduced to a form 
which enables us to generate simplified and modified general formulae 
(Section 3 and Section 4), 


2^ BACKGROUND 

A Lie transform may be defined by the differential equations 


— = W x (x,X,t; € ) (2.1a) 

^ = -W (x,X,t; € ) (2.1b) 

d e x 


d t 
de 


0 


(2.1c) 


— - = W (x ,X, t; e) 
d g t 


(2. Id) 


with the initial conditions x = y, X - Y, t = t, and F = 0 at g = 0. 
The foregoing equations define a canonical transformation. This can be 
shown as follows: for any g, the differentials dx , dX, 5x , SX, and 


\ 
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5F produced by the initial changes dy , dY, dt, 5y, and 5Y satisfy 
* 

the equation 

~ [dx.5X - dX. 5x - dt 5 f] = O . (2.2) 

Hence, dx.SX - dX.Sx - dt&F is independent of € and equals its value 
at € = 0, so that for F = H(x,X,t;e) - K(y,Y,t;e) 

x. 5X - X. 5x - 5H = y. 5Y - Y. 5y - SK. (2.3) 

Therefore, if x and X satisfy the canonical equations 

x = H., X = -H , (2.4) 

X x 

then, also y and Y satisfy the canonical equations 


y 


Y = -K 

y 


(2.5) 


Now, take any indefinitely differentiable function f(x,X,t;e) that can 
be expressed in terms of x,X,t and e as a power series in e, in 
the form 


f (x , X , t ; e) 


a> n |\n 1 co n 

Y “T — f(x,X,t;e) = Y -S. 

4 c=° ^ n n: 

n=0 J n=0 


f (x,X,t) 


( 2 . 6 ) 


Notice that 


-dx = dW x = W Xx .dx + W xx .dX + W xt dt , 


*-^5x — 5W — W . fix + W . fiX . etc . 
ae X Xx XX ' 
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then, interns of y,Y,t, and e as a power series in e, it takes the form 


f (x,X, t; e) 


where 


oo n T n "1 oo n 

2 * 1 h ,<■> 

n=0 L J £=0 n=0 


(y.Y.t) 


f (x , X , t) = f (x , X , t ; e)l 

n U e n J 


, n > O; 


e=0 


( 2 . 


( 2 . 


df 
d G 


i . \ & HP dx _ dX 

(x,X,t; e) = -T- + f + f„.— , 

de x de X de 


( 2 . 


and 


Notice that 




f^ n ^(y,Y,t) = J~ f(x,X,t;e) , n > O . 

e=0 


( 2 . 


f (x,X,t) = f(x,X,t;0) , and f (0) (y,Y,t) = f(y,Y,t;0) . 


Using Eqs. (2.1a) and (2.1b), Eq. (2.9a) can be written as 

df df , ^ 

di = "Si + Sr f 


( 2 . 


where L^ is a linear operator called Lie derivative generated by Vi 
and is defined by 


\ f = «;w> = f x .* x - f x -v 


( 2 . 


Taking f = x,X, and F in Eq. (2.7), and using Eqs. (2.1a), (2.1b) 
and (2. Id), one obtains the following 


x = y + 


J fr7 (n VY.t) 

n=l 
» n 

£ £y<»>( yi y ft ) 


( 2 . 


( 2 . 


,7) 

8 ) 

, 9a) 
9b) 

10 ) 

J t 

ID 

12a) 

12b) 
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H - K ' j £ R( 


(y,Y,t) , 


where, for n > 1 we have 


n-1 W x) 
d e /e=0 


7 W 

n-l x / ^ 

,d€ /e=0 


^T w t 


In particular, for a generating function W of the form 


W(x,X,t;e) = ^ ft W n+1 (x,X,t) * 


and f(x,X,t;e) of the form given by Eq. (2.6), Eq. (2.10) yields 


de n! n 


f (1) (x,X,t) = f 


I C " 

Z-. m 


L i f 

m+1 n-m 


c n = n: 

m (n-m)!m! * 
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and 

L f = (f;W ) , p > 1 . 

P P 

In general, for k > 1, n > 0, one obtains 


with 



P (k) 


(x , X , t ) 


f (k) 


n 


(x,X, t) 


f (k-l) 

n+1 


♦ y c n 

0<m<n 


f (k-1 ) 
n-m 


Now, letting e = 0 in the above equation we get the following, 
the remainder of this paper, this equation will be referred to as 
prit's equation.) 


where 


P (k) 


n 


(y.v.t) 


f (k-l) 

n+1 


y C " 

Z— m 
CKm<n 


m+1 


f (k-1) 
n-m 


L f 


P 


(f ; W ) = f .w 
p y py 


f v .w 

Y py 


p > 1 . 


Notice that 


„( 0 ) 


(y,Y,t) = f (y , Y, t) , 


and f^ k) (y,Y,t) = l (k) (y,Y,t) 


and 


Deprit's equation, together with the functions 
Y ^ ^ can be best visualized from the triangles 


H (n) , R (n) 
of Fig. 1 


, y 

and 


(2.17b) 

(2.18) 

(2.19) 

(For 

De- 

( 2 . 20 ) 
( 2 . 21 ) 


(n) 

Fig. 2. 
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FIG. 1. RECURSIVE TRANSFORMATION OF AN 
ANALYTIC FUNCTION UNDER A LIE TRANSFORM. 
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Finally, the inverse transformation can be written as 

oo n , 

y = x + ^ x n ^(x,X,t) (2.22a) 

n=l 

°° n ( ) 

Y = X + ^ — X ln; (x,X,t) . (2.22b) 

n=l 

To find the relation between the x^ n ^'s and y^ n ^'s, X^^'s and Y^ n ^'s 
one may eliminate x-y and X-Y between Eqs. (2.12a), (2.12b), and 
(2.22), and define the functions q(x,X,t;e) and Q(x,X,t;e) as follows: 

00 n 00 n 

q(x,X,t;e) = ^ FT x(n) < x . x - t > = - ^ ^7 y (n) (y,Y,t) (2.23a) 

n=l n-l 


Q (x , X , t ; e) = V p- X (n) (x,X,t) = - ^ Y (n) (y,Y,t) (2.23b) 

n=l n-l 

Therefore, for n > 1 we have 

<* n — x (x ,X , t ) , q^ =- y^(y,Y,t) , (2.24a) 

Q n = X (n) (x,X,t), Q (n) = - Y (n) (y,Y,t) . (2.24b) 


SIMPLIFIED GENERAL EXPANSIONS 

Given the functions f . f „ , . . , and f , Deprit (1968) construc- 
n n- 1 0 

ted the required functions f^ n and f^^ by introducing 

(k) 

the auxiliary functions f^ and by moving recursively from the left 
diagonal of Fig. 1 towards the right diagonal. One might as well 
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construct the function only in terms of f and f* n 1 \ f^ n 

f^°^ or f in terms of f^ n \ f^ n and f^°^ (which will 

n 

be useful in the construction of the inverse transformation) by intro- 
ducing a suitable linear operator. To show how this can be done, let 
us write Depri^s equation as 


■ 00 


f (k+D 
n-l 


I 

m=0 


n 1 


pOO 


m+1 n-m-1' — 


n > 1 , k > 0 


(3.1) 


By successive elimination of the functions on the right hand side of 
the above equation one would eventually obtain f^^ in terms of f^ + \ 


f (k+n-l) f (k) 


Thus, one may assume the following form for 


,00 


f (k) _ f (k+n) 
n 


11 

I 

j=l 


C .G f 
J 3 


(k+n-j ) 


n > 1 , k > 0 


(3.2) 


where G is a linear operator and is a function of L., L. and 

j 3 j-1 

L . Substitution of Eq. (3.2) into Eq. (3.1) yields the following re- 
cursion relation for the linear operator G^ 


G . - L . - 
3 3 


I 


C J “ 1 LG. 1 < j < n 

m m+1 j-m-1 — — 


<Xm<j-2 


(3.3) 


For example 


G 1 =L 1 


(3.4a) 


°2 = L 2 - L 1 L 1 


G 3 = L 3 ’ L 1 (L 2 - L lV - 2L 2 L 1 


(3.4b) 

(3.4c) 
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Using Eq. (3.2) with f = y and Y, and taking k = 1 , we obtain the fol- 
lowing general recursive relations for and of Eqs. (2.12a) 
and (2.12b) 


(n) w 

y = w „ v + 


y C ”- 1 G. y (n -J>, n 
A J J 


> 1 , 


(3.5a) 


l<j<n-l 


(n) 

Y = - W + 

ny 


y c"- 1 o. 

Lu j j 

l<j<n-l 


n-1 (n-j) 

Y v J , n > 1 . 


(3.5b) 


Using Eq. (3.2) with f = q and Q of Eqs. (2.23a) and (2.23b), and 

taking k = 0, we obtain the following general formulae for x^ n ^ and 

„(n) 


(n) (n) 

' = -y + 


y C" G. y (n - j) , n > 1 

Z-v J J 


(3.6a) 


l<j<n-l 


x (n) = -Y (n) + y C n G Y (n - J) , n > 1 (3.6b) 

3 3 

Kj<n-1 


Now, x^ n) (x,X,t) and X (n) (x,X,t) of Eqs. (2.22a) and (2.22b) are 
simply given by 


x (x ,X , t) = J 

[„<■>] 

X (n) (x,X,t) = 



(3.7a) 

(3.7b) 


Next, consider an indefinitely differentiable function v(x,X,t) 
not explicitly dependent on e. Using Eqs. (2.6), (2.7), and (3.2) with 


213 


EXPANSION FORMULAE IN CANONICAL TRANSFORMATIONS 

f = v(x,X,t), f =0 for n > 1, and k = 0, we obtain the following 
0 n — 

general formula 

00 n / N 

v(x,X,t) = ^ -p- v tn, (y,Y,t) (3.8a) 

n-0 

v (n) (y,Y,t) = y c" G. v <n ~ J) (y,Y,t) , n > 1 (3.8b) 

J J 

!<j<n 

where 

,Y,t) = v(y,Y,t) . (3.8c) 

Also the inverse relation can be written as 

v(y,Y,t) = v(x,X,t) + ^ Z- V n (x,X,t) , (3.9a) 

n=l 

elimination of v(y,Y,t) - v(x,X,t) between Eqs. (3.8a) and (3.9a), and 
using Eqs. (2.6), (2.7), (3.2) with k = 0, and (3.8b) leads to 

V^ n \x,X,t) = - |g v(y,Y,t)l y = x . (3.9b) 

L n J Y = X 

Lastly, given the Hamiltonian H(x,X,t;e) in the form 

oo n 

H(x,X,t;e) = V (x , X , t ) (3.10) 

Z n! n 

n=0 

one can construct the transformed Hamiltonian K(y,Y,t;e) in the form 

oo n 

K(y , Y, t ; e) = V Z- K (y,Y,t) (3.11) 

Z- n - n 

n=0 
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and this can be done as follows. Using Eq. (2.7), H can be written as 

oo n 

H(x ,X, t; e) = ^ ^7 H (n) (y,Y,t) . (3.12) 

n=0 

Combination of Eq. (3.12) with Eq. (2.12c) and Eq. (3.11) yields 


K = H 
O 0 


K = H (n) + R (n) , n > 1 . 


Setting k = 1 and f = H + R in Eq. (3.2) leads to 


H (1) + R (1) = K-Vc n GK , , n > 1 

n n n+1 £ j j n-j+1 


j=l 


But from the R and H triangles of Fig. 2, we have 




H (1) = H + 
n n+1 


Ii 


L H , n > O . 
m m+1 n-m — 


m=0 


(3.13a) 

(3.13b) 


(3.14) 


(3.15) 


(3.16) 


Therefore, the simplified general recursive relation of the transformed 
Hamiltonian is given by 


K 


0 


(3.17a) 


K 


n 


H + 
n 


I (Si 


L 

J 


H 

n-j 


+ 


C 


n-1 

3 


G 

J 



DW 

n 

~Dt" 


n 


> 1 


(3.17b) 
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where 


For example 


DW d w 


— ~ T — “ Lj 11 ^ 

Dt dt n 0 


- L H , n > 1 


(3.18) 


K = H 

_ 

DW. /Dt 


1 1 


1 



K = H 

+ 

L_ 

+ G., K - 

DW_ /Dt 

2 2 


1 1 

1 1 

2 

K = H 

+ 

L 

+ 2L 0 H 

+ 2G K + 

3 3 


1 2 

2 1 

1 2 

K = H, 

+ 

L H 

+ 3L H 

+ 3L H + 

4 4 


1 3 

2 2 

3 1 

operators 

V 

G 2 , and 

G 3 being 


(3.19a) 

(3.19b) 

(3.19c) 

t . 

(3. 19d) 


4. MODIFIED GENERAL EXPANSIONS 

In the simplified formulae obtained in Section 3, the rate of in- 
crease of the number of the Poisson brackets with respect to the order 
of perturbation can be reduced if one uses intermediate functions like 

f = G f or G f^ n ^ to be saved for later use in computation. Thus, 
j > n 3 n j 

this leads to the following recursive relationships: 

a) For y (n) , x (n) , Y (n) , and X (n) of Eqs. (3.5) and (3.6) 


y (n) = W 


l<j<n-l 


(n) (n) 

c = -y + 


y c "- 1 y. . 

3 J,n-J 

n-1 

y c n y. . 

Z- 3 J,n-J 


(4.1a) 


(4.1b) 


l<j<n-l 
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I C T ly j,n-J 

l<j<n-l 


Y (n) - -W + 

ny zL "j * j , n-j 


X <n) =-Y (n) + y C n Y. . 

J J,n-J 


y . . = L . y 
J.i J 


(i) 


y d - 1 

CKm<j-2 


L m+1 ^j-m-l,i 


Y . = L Y 
j ,1 J 


(i) 


y C J_1 L Y. 
Z-i m m+1 


CKm<j-2 


(b) For v ^ and of Eqs. (3.8b) and (3.9b) 


(n) 


V n n 

= / C v 

3 J » n-J 


l<j<n 


V (n) (x,X,t) = 


[ v _ I y = x 

n ’°J Y = X 


v . = L v 
J » 1 J 


(i) 


y c ^- 1 

Z-, m 


L . v . 

m+1 ; 


(Xm<j-2 


(c) For K of Eq. (3.17b) 
n 


K = H 
n n 


y fc^L.H + C n_1 K . 

\ J-l J n-j j J,n-jy Dt 


l<j<n-l 


where 


J ,1 


L.K. 
J i 



CKm<j - 2 


L . K. 
m+1 j-m-l,i 


(4.1c) 


(4. Id) 


(4.2a) 


(4.2b) 


(4.3a) 


(4.3b) 


(4.4) 


(4.5a) 


(4.5b) 
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Notice that for ^ = 0, K « O for all J’s. For example, 


K 2 = h 2 + L 1 H 1 + K 1(1 - DW 2 /Dt (4 ' 6a) 

K 3 = H 3 + L 1 H 2 + 2L 2 H 1 + 2K 1,2 + K 2,l - D V Dt (4 ’ 6b) 

K 4 = H 4 + L 1 H 3 + 3L 2 H 2 + 3L 3 H 1 + 3K 1,3 + 3K 2,2 + K 3,l ' °V Dt 


where 


K l,l 

= L 1 K 1 



K l,2 

= L 1 K 2' 

It 

1— l 
CM 

L 2 K 1 - L 1 K 1,1 

K l,3 

= W 

K 2,2 ■ 

L 2 K 2 - L 1 K 1,2 

K 3,l 

= L 3 K 1 

- L 1 K 2, 

1 - 2L 2 K 1,1 ’ 


(4.6c) 

(4.7a) 

(4.7b) 

(4.7c) 


The construction of the transformed Hamiltonian using the scheme pre- 
sented by Eqs . (4.5a), (4.5b), and (3.18) is simpler and requires less 
computer time and storage than the scheme presented by Deprit (1968). 

A considerable amount of this reduction is due to the fact that the sums 

H (n) + R (n) as well as H (1) + R (1) in Eqs. (3.13b) and (3.14) were 

n n 

considered as single quantities as if the transformed Hamiltonian was 

constructed from a single triangle whose end products are H^ ^ + R^ 

and whose starting elements are H^ 1 ^ + Further reduction in the 

n n 

computer requirements can be achieved if some of the K^’s vanish, in 
which case K also vanishes for all possible values of j. 

j » * 

Equations (3.17) or (4.5) and (3.18) are directly applicable to 
nonlinear resonant problems in which H^ is a function of only the 
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action variables X, while H n < n > D depend trigonometrically on the 
angle variables x and possibly the time t. It is desirable to trans- 
form to new variables so that the resulting Hamiltonian contains, to- 
gether with the new action variables Y, only certain slowly-varying 
"long-period' 1 combinations of the new angle variables y and the time 
t. Equation (3.17) or (4.5) may be used to define the w n * s successively 

so as to remove all "short-period' terms from the K * s; such a W is 

n n 

unique up to an arbitrary additive long-period term. 

The equations obtained are now being used in a fourth-order analysis 
of the motion (stability and periodic orbits) of a particle in the neigh- 
borhood of of the earth-moon system in the presence of the sun. In 

this problem, the following parameters are treated as first order small 
quantities: distance from L^/ear th-moon distance, eccentricity of 

the moon's orbit around the earth, moon mass/ear th mass, mean motion of 
the sun/mean motion of the moon. The additional parameter (earth-moon 
distance/earth-sun distance) is treated as a second order small quantity. 
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THE FORMAL SOLUTION OF THE n-BODY PROBLEM 


ABSTRACT: The power series solution of the equations of motion of 

a system of n point— masses is presented. This solution is a formal 
one in the time domain. The origin of the series expansions is a 
non-collision point. A procedure has been developed using three 
fundamental recursion formulas, one of which involves a special 
differential operator. Some of these analytical formulations have 
been programmed in the PL/I FORMAC language. Results are presented. 


Both authors are located at the IBM Cambridge Advanced Space Systems 
Department, FSD, Cambridge, Massachusetts. 
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1. Recent applications of the methods of celestial mechanics to prob- 
lems of space flights impose severe requirements upon the quality 
of the solution. Quality stands here for high level of accuracy 
in the computed position of a space probe when its motion takes 
place under the perturbations exerted by many bodies. Means to 
satisfy those requirements are offered by well-known numerical in- 
tegration techniques which can be applied to the equations of motion. 
Although efficient from a computational point of view, these tech- 
niques are regarded in general as being a rather crude approach to 
the solution of the problem. Besides, it might be desirable, from a 
theoretical point of view, that the solution be obtained in an analytical 
form, for instance that it be constructed as a time-power series. In 
this paper we show how this analytical goal can be achieved. We will 
give a recursive method to construct the terms of these series up to any 
desired high-order power of the independent variable (t). The formal 
solution thus obtained could be used to cover an arc of the trajectory 
much longer than the step used in any numerical integration procedure. 
This solution is valid, of course, in a neighborhood of t ■ 0, origin 

of the series expansion, which is assumed not to be a collision point. 

The crucial problem to be solved is to construct the series expansion 
of the inverse cube of the distance between bodies. This is achieved 
introducing some auxiliary functions and operating on them with an "ad- 
hoc" differential operator. The end result provides the coefficients 
of the series as functions of the initial conditions which must be sat- 
isfied by the equations of motion. 

2. We consider the motion of n bodies in a Newtonian potential field. 

Let m^(i ■ l,2,3,.*.n) be the masses of these bodies, to be considered 
as point-masses, and x^, y^, their Cartesian coordinates referred to 
an inertial reference frame. For the sake of generality, we suppose 
that none of the masses is negligible and that none of the bodies Is 
constrained to move along a prescribed trajectory. 
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The equations of motion can be written as follows 


(1) 

v 

1 j-i 3 

where 

-3 

(2) 

U ij " r ij 

(3) 

r2 u “ a2 u + b ' 

(4) 

a U " x j - V b u “ : 


(j 4 t) 


y j " y i’ c ij ' Z j “ V 


and similarly for y.^ and z ± . The symbol D stands for and k is 
the gravitational constant. If the fundamental units of length, 
mass and time are appropriately chosen, then we can take k ■ 1* 


The set of equations (1) constitutes a system of 3n differential equa- 
tions of the second order. 


Let now 
(5) 


x i0 ’ x i (0) > X il " (Dx i ) 0 


be the initial conditions. The formal solution of (1) as a Taylor 
series in t is 


( 6 ) 

where 


'i ’ l „ X iv tV 


v-0 


iv v! i 0 


The first two coefficients in (6) are given by (5). The third coeffi- 
cient x 12 is given by the right-hand side of equation (1), evaluated 
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at t ■ O and multiplied by one-half. Then, the successive coeffi- 
cients can be constructed by an iterative procedure. In fact, 
setting 


00 



we get from (1) by a well-known procedure 

1 n v 

(8) x i,v+2 " (v+2) (v+l)^_ l m j^_ 0 W ijp a ijv-p, 


This equation can be used recursively after we have learned how to 
compute ^ij v in term8 of the initial conditions (5). 

We put emphasis on the fact that there are -|n(n-l) functions y ^ which 
should be handled simultaneously due to the coupling of the subscripts 
i,j. This implies that the algebraic manipulations to be performed 
will be very lengthy, even for relatively small values of n. We will, 
however, omit from now on the subscripts i,j. The notation used by 
Sconzo [1] in his investigation on the tridimensional non-res tricted 
three-body problem will also be used here. 


3. We introduce the first of our auxiliary functions by the definition 


(9) s * aDa + bDb + cDc 
For compactness we rewrite s as follows 

(10) s - S [aDa] 

where the symbol S[...] has the meaning of a sum extended over terms 
in b and c similar to that in a. 


Then, successively differentiating the function y it is not difficult 
to recognize that 

(11) U - -tdV - - -^kpP + Q ) 

where is an expression in ct,e and the derivatives of s of order 
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higher than the second, and Q v a polynomial in a,e where p, a, and e 
are new auxiliary functions defined as follows 
_? 

(12) p - r , a - ps, e « pDs 

In order to prove (11) one has to observe that the first derivatives 
of all the auxiliary functions so far introduced can be expressed in 
terms of the functions themselves. In fact, it is 

2 

(13) Dy - -3ya, Dp - -2po, Do - e - 2o 

( 2 ) * 

( 14 ) De - -2ae + pD v 's 

In our recursive procedure the equations (8) and (11) are pivotal for- 
mulas, together with the formula obtained by applying Leibnitz rule to 
the right-hand side of equation (10). Distinguishing the cases where 

the order of differentiation is odd or even, the latter formula is, 

respectively, 

(15a) D (2v * 1) b - S[i( 2 £Jt 0> (wtl) a) 2 + £ 1 ( 2 £ 2 +p >U (V+1+p) sD ( ' H ' 1_p) a] 

(i5b> d< 2 ». . 

P-1 ^ 

We notice in passing that the functions y, p, s, a and e can be ex- 
plicitly expressed in terms of the initial conditions (5). The deriva- 
tives of s of order greater than the second become instead implicitly 
defined in terms of the same initial conditions by virtue of (15a) and 
(15b). 


[ 2 ] 

*We notice the analogy with the two body problem formulation . In this 

particular case there is only one function y and the auxiliary function s 

satisfies the differential equation 

D' 2 ^s - - ys 

Thus, equation (14) reduces to 

De - - a(2e + y) 

and the whole procedure is greatly simplified. 
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A. The method we have described can be considered completed if we 
succeed in giving explicit expressions for and for any 
desired value of v. 


To this end we consider the following operator 

(16) 0 - D - 5a 

Then, a simple algebraic manipulation provides 


(17) 


where 


P . - - 0P + A 

V+l V V 


Q , , » -3aQ + B 

v+l v v 


(18) 


A 


v 


fiQ 

fie 


d< 2 > 


S 


B 

V 


fiQ 

fia 


(e-2o 2 )- 2a 


fiQ 

V 

fie 


Thus, starting from 

p 0 ■ °- % — i 

any expression can be generated, by hand for lower indexed, by a 
computer for higher indexed functions and Q^. A program written 
in the PL/I FORMAC language has generated these functions, conse- 
quently the derivatives of y, up to orders far exceeding any practical 
need. We list in the table appended below the first six of these 
functions. For v > 6, the expressions become very lengthy, and this 
is the only reason which prevents their presentation in this paper. 


The problem of finding the formal solution (6) of the equations of 
type (1) can thus be considered solved. 
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TABLE OF THE FUNCTIONS AND 

v! v vPv 


P l-° 


Q x * O 


i_p 7 - 0 

Jr 2 


(e-5a 2 ) 


i_P .1„<2) 
3T3 6 8 


3^3 - | o (-3e+7a 2 ) 


i_p „ 1 n (3) 5 „ n (2) 

4!*4 24 D 8 “ 6 ° D s 


- - r + ° 2 (2e - 3 ° 2) 


1 r . 1 n (4) _5 „ n (3) 

51*5 120 D 8 " 24 D 


3jQ 5 “ 0 (5e 2 - 30o 2 e + 33aS 


- (e-7cr 2 ) D (2) s 


6! 6 720 


7?n d(5>s - 2l a d(4)s - 4 I (£ - 7<j2) d(3>s 61^6 “ fF - If ° 2 <«e 2 - 165a 2 E 


. 35 , - 2. -(2) 5 /r . (2) v 2 

+ 12 a ^ £ ” 3a ) D s " 72 p 


+ 143a ) 
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THE LONG PERIOD BEHAVIOR OF A CLOSE LUNAR ORBITER 
INCLUDING THE INDIRECT SOLAR GRAVITY PERTURBATION 


by 


Robert Dasenbrock* 


ABSTRACT 


The long period behavior of a lunar orbiter is considered. 
Of special interest are the effects due to the inclination of the 
apparent Earth's orbit about the Moon and those effects described by 
the laws of Cassini on the equations of motion. The first part of 
the paper is restricted to low orbits where the lunar gravity field 
dominates the terrestrial perturbation and to higher orbits of low 
inclinations where the argument of pericenter circulates through an 
angle of 360 degrees. The last part of the paper deals with near 
polar orbits where the indirect solar perturbation as described by 
the laws of Cassini is most important. Long-term stable positions 
for the orbit plane are found. 


* Doctoral Candidate and Research Assistant, Department of 

Aeronautics and Astronautics, Stanford University, Stanford, 
California (January 1969) 
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NOMENCLATURE 


a 


F 

F 0» F 1> F 2» F 3 

g 

G 

h 

h 

H 

X 


J 20’ J 22’ J 3’ J 4 

l 

L 

£ 

P 

Pi 


Pr 


r I 

r P 

? r 

"to 


semi-major axis 
eccentricity 

eccentricity of apparent Earth's orbit about 
the Moon 

the negative of the Hamiltonian 
components of F 
argument of perigee 
yjIaTI-e^), canonically conjugate to g 
angular momentum vector of satellite 
position of the ascending node 
^Aiad-e 2 ) cos i, canonically conjugate to h 
Hamiltonian 

component of the Hamiltonian 
inclination 

inclination of the Earth's orbit to the 
lunar equator = 6 degrees 44 min 

lunar gravity coefficients 
mean anomaly 

yjla, canonically conjugate to l 
Lagrangian 
a(l-e 2 ) 

momentum canonically conjugate to coordinate, 
rj, in inertial space 

momentum canonically conjugate to coordinate, 
r r , in the rotating frame of reference 

coordinate in inertial space 
perigee heigth = a(l-e) 

coordinate in the rotating frame of reference 
lunar radius 
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S 

s O» S l» S 2 
V(r) 

0 

M- 

U 

SI 


generating function 
components of S 
arbitrary potential function 
Earth coordinate 

momentum canonically conjugate to 
gravitational constant of the Moon 
argument of peri center = g 
position of the ascending node = h 


235 



CLOSE LUNAR ORBITER LONG PERIOD BEHAVIOR 


I. INTRODUCTION 


Some attention in recent years has been focused on the 

problem of determining the motion of a lunar satellite. The 

problem is complicated by the peculiar nature of the Moon's gravity 

12 3 

field. Early attempts by several authors ' ' on the solution of 
this problem were made by assuming the Moon to be in nearly hydro- 
static equilibrium. Thus only the J 2Q and J 22 gravity coefficients 
were carried in the equations of motion. The higher harmonics J 3> 

J^, etc., were either ignored or assumed to be of order 

Independent determinations of the gravity coefficients by both the 
4 5 

U.S. and the U.S.S.R. invalidate this assumption. It appears 

from the early data that the oblateness coefficient, J , has a 

-4 2 

value of approximately -2.0X10 . However preliminary data from 

Lunar Orbiters I through V still gives no conclusive evidence on 

the absolute values for the higher gravity coefficients. It 

-5 

appears at this time that these are all at most of order 10 

The lunar orbiter problem is further complicated by the 
large perturbation caused by the Earth. For an orbiter of moderate 
height, say 800 to 2000 km above the surface, the terrestrial per- 
turbation is roughly equal to the oblateness effect of the Moon’s 
gravity field. 

Of primary interest will be the long period effects, i.e., 
those fluctuations in the orbital elements having periods of 
several months or longer. Short period variations, all of which 
have much smaller amplitudes, will be averaged out. For a discussion 

of these latter effects the reader is referred to papers by 
2 3 

Giacaglia and Osterwinter . The lunar gravity coefficients, J 2Q , 

J .J will be retained in the equations of motion. Cassini's 
22 3’ 4 

laws on the figure of the Moon will be considered in their classical 
form, i.e., the smaller effects of the physical librations will 
be ignored. 
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The relative effect of the eccentricity of the terrestrial path 

on the short period variations is of order e (=0.055). However 

“ 2 2 

its effect on the long period behavior is of order e £ (=0.055 ) 
and thus will be neglected. Such is also the case with terms in- 
volving e sin I . The small effects of the solar radiation pres- 
E E 

sure along with the direct solar gravity effect will not be 
considered . 


I I . CHOI CE OF REFERENCE FRAME 


As the behavior of close lunar orbiters is of primary 
interest, a reference frame coincident with the lunar equator is 
most convenient. This is especially so if the higher harmonics 
of the Moon's gravity field are considered. However, as a con- 
sequence of Cassini's laws, the plane of the lunar equator is not 
fixed in space. Cassini's laws state that the plane of the lunar 
equator, the ecliptic, and the plane of the Moon's orbit all 
coincide in a common line (ignoring physical librations). This 
line is the node of the lunar orbit as referenced to the ecliptic. 
It is convenient to choose this line (the ascending node) as the 
x-axis of the reference frame. The lunar axis of rotation is the 
z-axis. Thus full advantage is taken of the geometry of the 
system. This is described in Fig. (1). The system rotates in 
retrograde manner with a period of about 18.5 years. 

If one is to work in the rotating system just described, 
the equations of motion, derived for a satellite moving in an 
inertial frame, must be modified. It is suggested that this 
modification take the form of an an additional perturbing term in 
the Hamiltonian. The system of reference is rotating with 
angular velocity components 

to = 0.0 rad/sec 

X -10 

u = 3.0X10 rad/sec (1) 

y -8 

u> = -1.07x10 rad/sec 

z 
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In an inertial frame of reference the Lagrangian is 


£ = r T /2 - V(r T ) 


In the rotating frame 


£ = (r r + Uxr r ) /2 - V(r r > 

^ d£ - - - 

P r - dF = r r + “ Xr r 


The Hamiltonian is 


K = P 

r r 


Expressing H in terms of (p , r ) one obtains 

r r 


K * g V P r + V(r r } - “* h 


( 2 ) 


(3) 

(4) 


(5) 


( 6 ) 


where h is the angular momentum of the satellite whose components 
are 


h = G sin i sin h 
x 

h = -G sin i cos h (7) 

y 

h = G cos i = H 
z 

where G,H,h are the usual Delaunay variables. The additional 
term to be added to the Hamiltonian is 


«r 

«r 


-to-h = -w h - w h 
y y z z 

+co G sin i cos h - cj H 
y z 


( 8 ) 
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III. THE DISTURBING FUNCTION 


The computation of the disturbing function due to the 

perturbing effects of the Earth and Moon is straightf oward but 

lengthy and thus will not be reproduced here. Employing a 
7 

result due to Brouwer and Kozai , the Hamiltonian in mixed 
Keplerian and Delaunay variables is 


- j{ = F = +2^2 - n E © - UyG sin i cos h + w z H 


4 2 

+ | jj 20 (l-3cos 2 i) + 6J 22 sin 2 i cos 2(h-0 E ) 

L G 


3 ^Rm 3 ! 

8 L 3 G 5 r 

o 6 4 f 

3_ j Jr 

128 37 


J^e sin i (l-5cos i)sin g 


2 4 2 

[(3-30cos i +35cos i)(2+3e ) 

2 2 4 

■10 e cos 2g(l-8cos i +7cos i) 


n 2 2 
16 


2 12 2 
(2+3e )<3cos i-1 +3sin i cos 2 

2 f i 2 2 

+ 15e \-(l+cos i) cos 2(g+h-0 E ) + sin i cos 2g 


(h-0 E )| 


1 2 1 
+ -(1-cos i) cos 2(g-h+0 E )> 


+ sin I F (i 


E ^(6+9e )sin 2i cos h -cos(h-20 E ) 
2 

-30e sin i cos i cos h cos 2g 


-sin h sin 2g +sin(h-2g)sin 2g 

-cos i cos (h-2&£> cos 2g| 

2 f 2 2 2 2 2 

+ sin I £ \(6+9e )(sin i -sin h -cos h cos i) 

2 2 2 2 
(sin h +cos h cos i +sin i)cos 2 t? E 
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+30e^cos i sin h cos h cos 2 0 £ 
-15e 2 isin2h + sin 2 i - cos 2 h cos 2 i 


cos i sin h cos 


h |si 


sin 2 g 


+(cos 2 h cos 2 i - sin 2 h - sin 2 i)cos 20 


5 |cos 2 g | 


(9) 


The short period terms containing £, the mean anomaly, have been 
averaged out. 0 is canonically conjugate to 0 E , the Earth coordinate, 
and F is the negative of the Hamiltonian. This convention will be 
used throughout . 

It is desireable to write the Hamiltonian in the form 

f = f 0 +f^+f 2 + ... 

where Fq is of order unity. F^ is of order 10 ~ 2 , Fg is of order 10 "^ 
and so on. To determine the order of each term in Eq.(9), Fig. (2) 
is found useful. The terms |j. 2 /2L 2 and n^ are seen to belong to Fq 
and Fi respectively. Terms assigned to F 2 are co z H and the contri- 
bution associated with J 2 q and J 22 - The Earth perturbation and 
those effects due to J 3 and J 4 belong to F^ . The disturbing function 
is thus of the form 

F(L,G,H,0 ,g,h, 0 E ) = F 0 (L) + F 1 (0) + F 2 (L,G,H, g , h , 0 E ) 

+ F^OjjG^, g,h, 0 g) + ... 

At this point one wishes to eliminate all terms in F 
containing 0 E . This is accomplished by means of a stationary 
generating function 

S(L* ,G* ,H' ,0 * ,g,h, 0 E ) = L ' i + G'g + H*h + 0*0 E (10) 

+S 1 (L , ,G’,H , ,g,h,0 E ) + S 2 (L* ,G’ ,H f ,g,h, 0 £ ) + ... 

such that the new Hamiltonian F*(L' ,G* ,H' ,0* ,g* jh 1 ) does not contain 
0 £ , The new coordinates are related to the old by the relations 
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i — _ it 

L " 51 “ L 




with similar relations for the other variables. From the 
relation 

F*(L\G’,H\e\B\h\9p = F(L,G,H,0,i,g,h,e E ) 


the following equations are derived for and £> 2 . 

F 2 a\G\H\g,h,-) = Fga’.G'.H'.g.h.ej.) - 
* ^F2c>Si dF2^Si _ ^FodSi dF2c)Si ^82 

f 3 + 5J5 g* + 5E5h*“ f 3 + + Zvdh " n E§e; 

choosing and to cancel the periodic parts of Eqs. (12) 

oGg deg 

and (13) respectively, the new Hamiltonian is 


( 12 ) 

(13) 




2 l T 2 - “E 0 ' + F|(L\G\H\g',h') + 

+ 1F1F + F 3 a’,G',H’,g’,h’) 


(14) 


As F* does not contain e£, 0* is constant and will be dropped 
in the following discussion. 

In order to use the von Zeipel method to determine 
the long period behavior of the orbital elements, F must be of 
the form 

2 

F* = ^72 + Fja-.O-.H-) + Fjtt'.G’.H'.g’.h') 


(15) 


where F* is dominated by F^. This can be done if the orbits 
under consideration are restricted. The J.ower orbits, where 
JgQ is the dominant perturbation, automatically fall into 
this category. Also included are the higher orbits provided 
the inclination remains low. 
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in Eq. (15) is in mixed variables 


4 = 7^ '20U -3cos 2 i) —r — (2+3e 2 ) (l-3cos 2 i) 

L. G* 3 

3 Lt 6 Rm 4 


w H* - — - K m J 4 [3-30cos i+aScos^i] (2+3e ) 
128 L’ 3 G’ 7 


j »2 


16 


sin 2 I £ [ (2+3e 2 )|(3cos 2 i-l)] 


The coupling terms in Eq. (14) have been dropped because of their 

relatively small size. All of the terms independent of g and 

* * * 
h in F 3 are included in Fg. F 3 is 


5 3 

F~ = -j- ~ _ J~e*sin i (l-5cos 2 i) sin g' 

3 8 l’ 3 g' 5 


6 4 

30 [i Rm 


J^[ l-8cos 2 i+7cos^i ]e 2 cos 2g* 

» ‘'pi * 


128 


L* G 


+ n £ 2 a ,2 e ,2 sin 2 i ' cos 2g'- tocsin i* cos h* 


+ 16 


sin I £ |(6+9e ,2 )sin 2i'cos h* 
-30e ,2 sin i' [cos i* cos h' cos 2g’ 
sin h* sin 2g f 




sin 2 I E j( 


sin^l E { (3+^e ,2 )sin 2 i ' cos 2h' 


2 

15e* cos i f sin 2h* sin 2g' 


+ ™e ,2 (l+cos 2 i * )cos 2h* cos 2g* 


^e ,2 sin 2 i 'cos 2g 


•}] 


(16) 


(17) 
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It should be pointed out that for the lower orbits the 
various angles appearing in Eq. (17) i.e., g', h ’ , h'-g', h*-2g\ 
are all driven by the dominant F* term. One must beware however 
of the various instances where any of the angular rates become 
small. This occurs near the eleven (slightly altitude dependent) 
critical inclinations® at i* = 46.6, 56.1, 63.4, 69.0, 73.2, 90.0 
106.8, 111.0, 116.6, 123.9, and 133.6 degrees. When the inclina- 
tion is near one of these critical values, the resulting behavior 
of the coordinates can exhibit very long period variations and 
the von Zeipel method, now to be followed, fails. A method 
valid in these special situations will be outlined in a later 
section. For a high orbiter having a moderate inclination 

(above 40 deg.) F* in Eq. (16) will contain some g* and h : 

2 * 

dependent terms that are now included in Fg. In this case the 

von Zeipel procedure fails. This situation is discussed by 

1 9 

Kozai and Vagners • 


IV. THE LONG PERIOD TERMS 


As before one looks for a transformation from variables 
(L , ,G , ,H , ,g , ,h») to new variables (L m ,G" ,H ,f , g" ,h” ) such that the 
new Hamiltonian F** is a function of (L tf ,G*' ,H** ) only. Consider 
the stationary generating function 

S* = L"X’ + G’v + H"h* + S*(L",G”,H",g",h"> + ••• 


The relation between the coordinates are 


G' 



g" 


g’ 



with similar relations between the other variables. 


(18) 


(19) 
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Choose S 


where 


to be of the form 


S* = a i sin 2g ' + a 2 COS g * + a 3 sin h ' + a 4 sin 2h' 
+ a^sin(h* +2g' ) + a 6 sin(h , -2g* ) 

+ a 6 sin(2h , +2g* ) + a^sinteh'^g’ ) 


( 21 ) 


^ H 6 R ^ j (l-8cos 2 i"+7cos 4 i"\e" 2 
128 l „ 3 g „7 4 \ J 


15 2 ,,2 „2 . 2.,,, 3 2 1 /„oF ? 

— n E a e sin i (1 - jsin Igl / 2^ 


= r _3 ^ 

L 8 l” 3 g" 

= £ U) Z G" 

r 2 „ 

n F a 

‘ If “ 

■[-« 


5 J 3 e, sin i" (l-5cos i") 


/sfo 

5 # 


2 2 

sin i" — ^ sin l£(6+9e n )sin 2i fl 


16 


sin 2 l (6+9e" 2 ) sin 2 i ft 
b 


'c*Fj 


oFq 




2 ,,2 ,,2 . . it /. . it v 

n a e sm ± E sin l (1+cos i ) 


rS + S 


E 2 a" 2 e" 2 sin I £ sin i" (1-cos i") 


«7 = 


[- H«, 

[- S ”, 2 -” 2 *" 
[* S "j 2 -" 2 -" 


f & - * 
"> a ]/l# * ‘H*] 

■> 2 ]/fe - °1*] 


2 2 

sin I E (l4cos i 


2 sin 2 I E (l-cos i' 
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The Hamiltonian F** is now independent of g" and h n and is 


4 2 


F** (L n ,G n ,H" ) = — + x — J 20 ( 1 - 3cos2in ) + w H" 

2 4 ^. 3 ^ 


2L 


4 


_3 ^ Rfl _ J (2+3e" 2 ) 3-30cos 2 i"+35cos 4 i 


128 l ., 3 g „7 4 


II 


2 2 

+ £ 16 2 '" ( 2+3e " 2 >( 3 cos 2 i"-l)a- | sin2l E > (22 > 


As and are known, Eqs. (11) and (19) are utilized 
to determine the behavior of the elememts (L,G,H,i,g,h) . The 
coefficients of the trigonometric terms in Eq. (21) contain six 
critical divisors. These correspond to the eleven (slightly 
altitude dependent) critical inclinations mentioned previously. 

It appears from an inspection of Eq. (21) that near a critical i" 
the amplitude of the coordinate variations can become nearly 
infinite. Actually this is not the case as will be shown in the 
following example. 

Suppose the inclination is near 90 degrees, A near polar 
orbit is chosen as the very long period behavior resulting from the 
laws of Cassini is best demonstrated. The slowly varying Hamiltonian 
is (i.e., the relatively fast variable, g' , has been averaged out) 


2 4 2 

p** _ fcL + I ^ o J on (l-3cos 2 i" ) + wH" - (JySin i"cos h” 

2L m ^ ** L 1 * g h ^ z 


_2 „2 


16 


e” 2 ) ( 


+ nE ^ (2+3e" 2 ) I (3cos 2 i M -l) + 3sin Ir-sin 2i"cos h n 


+ ^ sin 2 I F sin 2 i” cos 2h H 
2 E 


(23) 
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where all of the elements are slowly varying. The secular terms 
involving J 4 and sin 2 I E have been omitted in Eq. (23) due to their 
small size. Since the Hamiltonian does not depend on S' or g n , L n 
and <j* are constant and the equations of motion have been reduced 
to that of one degree of freedom. They are expressed as 


h" = - 


§ 3 *' 


3 T /Rm\ 3/n E \ (2+3e 2 ) 

2 J 20\ p ** / - sin ) 


H" 

n g" " “z 


( 24 ) 


-G"sin = H" = + gjj, = UyG"sin i"sin h" 


' if n E 2 a" 2 (2 + 3e" 2 ) 


2 2 

sin I E sin 2i"sin h H + sin l v sin i H sin 2h' 


(25) 


-8 -4 

Note that w z = - 1.07XK) and J 2Q = - 2.0X10 


The phase plane 

(H** ,h" ) contours of constant F** in Eq. (23) are shown in Fig. (3). For 
very low orbits (Fig. 3a) the stable equilibrium points occur at 


0 degrees and i" = cos 


88 degrees. Recall that the 


nodal position, h , is measured from the point where the plane of 
the lunar equator, the ecliptic, and the terrestrial path meet in 
a common line. It appears that the orbital plane of a low orbiter 
can become trapped in this same configuration. Or it can exhibit 
very slow stable oscillations about this position, the period of 
which is about twenty years for a low satellite. 

The interplay between the inclination and nodal position 
demonstrates remarkably different behavior for higher orbiters. 

For a semi-major axis of 1.5 and an eccentricity of 0.0, the 
behavior of (H u ,h n ) or equivalently (i M , f/ 1 ) is shown in Fig. (3b). 
In this case the inclination can be trapped near 83 degrees but 
h" appears to be stable between 0 and 90 degrees. The behavior 
for a still higher orbiter is shown in Fig. (3c). 
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For this case the stable equilibrium solutions occur at slightly 
over 90 degrees (and slightly less than 270 degrees) and i" = 74 
degrees. This is near the 73.4 degree critical inclination (when 
h n - g" = 0) mentioned previously. However in this particular case 
a closer examinination shows that this critical i" occurs at about 
66 degrees. This particular orbit may eventually impact the sur- 
face. (cf. Kozai) 


V. CONCLUSIONS 


The long period behavior of a lunar orbiter is 
determined for a certain class of orbits. The method of succes- 
sive approximations is employed in treating the circulating 
orbits. In this case the angles g” , h", g"+h" , etc., were 
assumed to move at nearly uniform rates. The librating orbits 
(when one of the angles does not move through an angle of 360 
degrees) are treated by the use of a phase plane analysis. 
Treated, as an example of the latter, are near polar orbits in 
which the indirect effect of the Sun (described by the laws of 
Cassini) is important. Stable altitude dependent positions of 
the orbital plane are found. 


VI . ACKNOWLEDGEMENTS 


The author wishes to thank his advisor, Professor 
John V. Breakwell, for his helpful suggestions and guidance. 

This research was supported by the National Aeronautics 
and Space Administration under Contract No. NsG 133-61. 


247 



CLOSE LUNAR ORBITER LONG PERIOD BEHAVIOR 


REFERENCES 


1. Kozai, Y. , "Motion of a Lunar Orbiter," Journal of the Astro- 
nomical Society of Japan, 15, 1963. 

2. Giacaglia, G.E.O., "The Motion of a Satellite of the Moon", 
Goddard Space Flight Center Report X- 547- 65-218, Greenbelt, 
Maryland, June 1965. 

3. Osterwinter, C. , "The Motion of a Lunar Satellite," Ph.D. 
Dissertation, Yale University 1965. 

4. Melbourne, W.G. , et al, "Constants and Related Information 
for Astrodynamic Calculations," J.P.L. Technical Report 32- 
1306, July 1968. 

5. Akim, E.L., "Determination of the Gravity Field of the Moon 
by the Motion of the AMS LUNA 10," N.A.S.A. No. ST-CM-LPS- 
10532, 

6. Breakwell, J.V. and D. Hensley., "An investigation of High 
Eccentricity Orbits about Mars," First Compilation of Papers 
on Trajectory Analysis and Guidance Theory, N.A.S.A. SP- 
141, 1967. 

7. Brouwer, D. , "Solution of the Problem of Artifical Satellites 
Without Drag," Astronomical Journal, 64, 1959. 

8. Garfinkel, B., "On the Motion of a Satellite in the Vicinity 
of the Critical Inclination," Astronomical Journal, 65, 1960. 

9. Vagners, J. , "Some Resonant and Non-Resonant Perturbations 
of Earth and Lunar Orbiters," Ph.D. Dissertation, Stanford 
University, August 1967. 


248 


NORMAL TO ECLIPTIC 



249 




sin i cos h 



THE RELATIVE IMPORTANCE OF THE VARIOUS EARTH MOON PERTURBATIONS 







LECTURES ON NONLINEAR RESONANCE 


By W. T. Kyner 

University of Southern California 


253 



LECTURES ON NONLINEAR RESONANCE 
W. T. KYNER 

In the early 1940' s, interest in the theory of nonlinear 
differential equations developed rapidly in the United States. 

Friedrichs, Hurewitz, Levinson, Stoker at Brown, Lefschetz, Bellman 
at Princeton, and Minorsky at the David Taylor Model Basin were among 
those most responsible. In particular, Lefschetz recognized the 
importance of the Soviet contributions during the preceding decade 
and helped make much of this work accessible to the American technical 
public. In 1942, he prepared a translation of excerpts from mono- 
graphs of Krylov and Bogoljubov [5 ] whose averaging techniques are 
closely related to the general perturbation theories of celestial 
mechanics. It is interesting to note, however, that just when Krylov 
and Bogoljubov were starting their research in nonlinear mechanics , 
an elderly American, E. W. Brown, Gibbs professor of mathematics at 
Yale University, explained and essentially justified the important 
concept of resonance as a basically nonlinear phenomenon. His lectures, 
"Elements of the Theory of Resonance Illustrated by the Motion of a 
Pendulum," were given at the Rice Institute in April 1931 and were 
later published as a Rice Institute pamphlet [2 ]. They are particu- 
larly relevant to this year's Yale Summer Institute because of the 
importance of resonance phenomena in geodetic satellite theory. 

In my lectures on resonance I shall follow Brown's exposition 
of the basic concepts, but I shall use the Krylov-Bogol jubov method of 
averaging in the mathematical analysis. The main application of this 
theory will be to satellite problems. 
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1 . Pendul urn problems 

As we all know, a stretched wire has certain modes of vibration 
which seem independent of the strength of the energy source. But we 
tend to forget that the "natural frequencies" of these modes are a 
mathematical fiction since they are only present "when the vibrations 
have infinitely small amplitudes, which amounts to saying that the wire 
is not vibrating at all. More properly, a natural frequency should be 
defined as the lower limit of the frequency of that particular mode of 
vibration. It is necessary to insist on this change of frequency with 
change of amplitude because the existence of the phenomena of resonance 
depends on the existence of this change" (p. 2 of [5]). Furthermore, 
a detailed analysis of the "locking in" effect which is observed when 
two piano wires are tuned to the same frequency depends in an essential 
way on the change of frequency with amplitude. This is discussed in 
detail by Brown and more concisely by Cesari (p. 151 of [3 ]). I shall 
omit such a discussion here and go directly to the pendulum problems 
which are physically less interesting, but more relevant to satellite 
theory. 

We first consider an ideal pendulum of length b with an 
oscillating support (see figure 1). 

Let Y be the horizontal distance of the support point S 
from a fixed point 0 and x the angle which the pendulum makes with 
the vertical. The support point S is constrained to move in the 
horizontal direction. The equation of motion of the pendulum is 

(1.1) cos x + b - - g sin x, 

dt 2 dt 2 
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(we sum the forces along the line perpendicular to the pendulum). 

We now assume that S oscillates with a motion given by 
Y(t) where 

= -eg f(at), e small, f(z + 2-rr) * f(z), all z. 
dt 2 

Then 

(1.2) + u 2 sin x * e f(i|0 cos x, to 2 = g/b, ^ = at. 
dt 2 

The second model problem is that of two pendulums, each of 
mass m, but with different lengths, attached to a bar of mass M 
which is constrained to move in the horizontal direction (see figure 2). 
We assume that the total horizontal momentum is zero, i.e., 

(1.3) -gj* [MY + m(Y + b sin x) + m(Y + a sin y)] = 0. 

The equations of motion are 

d 2 Y d 2 x 

— cos x + b — = - g sin x, 

dt 2 dt 2 

(1.4) 

d 2 Y ^ d 2 y 

cos y + a = - g sin y. 

dt 2 dt 2 

Using (1.3) to eliminate d 2 Y/dt 2 , we obtain 
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A2 x 2 . 

+ (i) 2 sin x = e cos x (sin x + r- sin y) , 

dt 2 dt 2 b 


(1.5) 


4-^ + a 2 sin y = e cos y - — (sin y + 4 sin x) , 
dt 2 dt 2 a 


where w 2 = g/b, a 2 = g/a, e = m(M + 2m)" 1 . We assume that e is 
small. Equations (1.5) are awkward to work with since the second 
derivatives of x and y appear in both equations. We therefore 
rewrite the equations as 


4-2. + u) 2 sin x = - e cos x [1 - e (cos 2 x + cos 2 y)] _1 
dt 2 

dx 2 

{a) 2 (cos x sin x + cos y sin y) + sin x 


( 1 . 6 ) 


+ f sin 


y 



4-£+ a 2 sin y = - e cos y [1 - e (cos 2 x + cos 2 y)] _1 
dt 2 

t) dx 2 

{a 2 (cos x sin x + cos y sin y) + - sin x (^-) 

+ sin y ($)*> . 

Each equation of (1.6) can be interpreted as a perturbed 
pendulum equation. We therefore can use the same mathematical pro- 
cedures on equations (1.2) and (1.6). The first, and rather difficult, 
step is to introduce new coordinates so that the differential equations 
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will be in the normal form for the method of averaging. In order to 
motivate the coordinate change and to display the simplest features of 
resonance, we shall now study the linear differential equation obtained 
from (1.2) by making the small angle approximation, i.e., sin x = x, 
cos x = 1 . We have 

(1.7) + (o 2 x * e f(ijj), ^ = at. 
dt 2 

The solution to (1.7) can be written 
- t 

(1.8) x(t) = r Q cos 6(t) + ^ / sin w(t - u) f(au) du, 

o 

where fl(t) 5 u t + 0 , r Q and 0 q are constants determined by 
initial conditions. 

If a, the frequency of the forcing function, is an integral 
multiple of w, the frequency of the linearized pendulum equation, 
then unbounded solutions of (1.7) are possible. In other words, if 

(1.9) ak = oj, k an integer, 

then the condition of linear resonance has been satisfied. It is 
obviously the same for all forcing functions of period 27r/a, but 
the existence of unbounded solutions depends on the presence of 
sin wt or cos ait in the Fourier series expansion of a particular 
forcing function. The concept of linear resonance is of limited 
physical significance since the small angle approximation is destroyed. 
Before leaving the linear approximation, let us consider the 
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homogeneous (e = 0) problem with the aid of the corresponding phase 
and potential planes (see figure 3). In the phase plane we plot the 
level curves of the energy integral, 

E( x ,x) = x 2 + ^ co 2 x 2 = h , a constant, 

and in the potential plane, the two curves 

c = h , c « j a) 2 x 2 . 


Each level curve is characterized by its energy and therefore by its 
amplitude. We now introduce r, the amplitude (note that h = oj 2 r 2 /2), 
and e, a normalized angle, as dependent variables, i.e., we set 


(1.10) x = r cos 6, x = - 03 r sin e, 0 = oj t + 0 Q . 
The inhomogeneous equation (1.7) is equivalent to 

dt = w sin e * 


( 1 . 11 ) 


de _ 
dt 


<D + f(ljf) COS e, 


d^ _ 
dt 


a. 


Equations (1.11) are in the normal form for the method of averaging. 
Note that if e is nonzero, they are nonlinear. 

In order to reduce the nonlinear pendulum equation (1.2) to 
normal form, we seek a coordinate transformation (compare with (1.10)) 


261 




LECTURES ON NONLINEAR RESONANCE 

(1.12) x = F(r,6), x = G(r,0) , 

with F and G having period 2 tt in e, such that the unperturbed 
(e = 0) equations have the form 

dr = o 

dt u * 

(1.13) Jf ■ z(r), 

f ‘ «■ 

It will be shown later that z(r) ■ id 2 (1 - r 2 / 1 6 + ...)* 

The construction of the transformation (1.12) is somewhat 
complicated, but it can be motivated with the aid of the phase and 
potential planes. We again plot (see figure 4) the level curves of the 
energy integral, 

E(x,x) - \ x 2 + u) 2 (l - cos x) = h, a constant, 

in the phase plane, and the curves 

c = h, c = co 2 (l - cos x) , 
in the potential plane. 

We see that h less than 2w 2 implies that the motion is 
periodic and that the level curves are characterized by the amplitude. 

The transformation (1.12) is therefore possible. Since we need to 
study several nonlinear differential equations in the satellite problems, 
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we shall give a general construction which will then be applied to 
the pendulum problems. It is similar to the one used by Brown and 
more recently by Morgunov [7]. 

Consi der 

(1.14) — +q(x)=0, q(-x) = -q(x), q'(0)>0. 

dt 2 

If r is positive and not too large, then 

1 x 

(1.15) i* x 2 + Q(x) = Q(r) , where Q(x) = / q(x')dx l , 

o 

is the equation of a closed integral curve in the phase plane. It is 
generated by a periodic solution of (1.14). 

From 

(1.16) x = (2[Q(r) - Q( x) ] ) 1/2 = fl(r,x), 
we have 

x -i 

(1.17) 6 = z(r) / A (r,x‘) dx' = B(r,x), 

where the frequency z(r) is given by 
r 

(1.18) 2tt z _1 (r) = 4 / A (r,x') dx‘ . 

o 

Equations (1.16-1.18) implicitly define the required transformation 
(1.12). Clearly, 
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(1.19) H = z(r). 


dr 

dt 


= 0 . 


(1.20) 


will be 


0 . 21 ) 


The perturbed equation, 

— +q(x) = e flat), 

dt 2 

transformed into a system of first order equations, 
= z{r) + e 0(r,0,i))) , 

^ = e R(r,e,i|i), 


dib 

dt 


a. 


To do this we write 


dx _ 5A dr . 3A dx 
dt 3r dt 3x dt * 


( 1 . 22 ) 

de _ 3B dr 3B dx 

dt ~ 3r dt 3x dt * 


But 


dx 

dt 


A(r,x) , 


— = - q(x) + e f(ot). 
dt 2 


and 
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fp - q(r) A _1 (r,x) , M = — q ( x ) A'W), 
ff = z(r) A _1 (r,x). 

Hence, 

•gl = z(r) + e f(iji) A(r,x) z"’(r) |£<r,x), 

(1.23) || = e f(*) A(r,x) q' l (r), 
dUj 

dt = 

where e = B(r,x) determines x as a function of 6 and r. 

Equation (1.23) can be simplified if we have an explicit 
formula for x = F(r,0), the inverse to 6 = B(r,x). For from 

k = A(r,x) = z(r) §f(r,e). 

A(r,x) |p{r,x) = - A(r,x) ||(r,x) = - z(r) |£<r,e), 

we obtain 

= e f(*) z(r) q _1 (r) jjr.e). 

(1.24) 5 | = z(r) [1 - e f(*) q _I (r) g(r,0)]. 

= n 
dt a - 
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Note that F(r,e) and z(r) can frequently be represented by an 
infinite series, e.g., by using Lindstedt's method (p. 116 of Cesari 
[3]). 

We have finally transformed our differential equations into 
the normal form for the method of averaging, our next topic. 

2. The method of averaging. 

For convenience, new notation is employed in this section. 

We consider a system of ordinary differential equations, 

= £ X(x,y), x = ( X J x^) , 

( 2 . 1 ) 

= z(x) + e y(x,y), y = (y x .. . . ,y N ) . 

with initial conditions x(0) = a, y(0) = b. The vector valued 
functions, X(x,y), Y(x,y), are assumed to be smooth and to have 
period 2tt in each y n . The x m are called slow variables, the y n 
fast variables, since if e = 0, 

x m = V 1 £ m £ M, 

( 2 . 2 ) 


y 


n 


z n (a) t + b n , 1 < n < N. 
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Our goal is to construct a transformation, 
x * u + e P(u,v), 

(2.3) 

y = v + e Q(u,v) , 

with P and Q having period 2 t t in each v n# so that the equations 
(2.1) become 

= e U(u) + e 2 W^UjV.e), 

(2.4) 

= z(u) + e V(u) + e 2 W 2 (u,v,e). 

In other words, the fast variables have been eliminated (to first 
order) from the differential equations. As we shall now show, this 
elimination is an averaging procedure; in fact, we can take 

2tt 2tt 

U(u) = (2 tt)' N / ... / X(u,y) dy ...dy N , 

0 o 1 

(2.5) 

V(u) = (2 u)' N / ... / Y(u,y) dy ...dy N . 

0 0 1 

Approximate solutions to (2.1) can be constructed by solving the 
first order averaged equations, 
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^ = e U(u), u(O) = a', 

(2.6) ^ = z(u) + e V(u) , v(O) = b', 

a = a' + e P(a' ,b'), b = b' + e Q(a‘ ,b'), 

and substituting the solution into (2.3). 

If we differentiate (2.3), then from (2.1) and (2.4), we have 

e X(u + e P, v + e Q) = (I|yj + £ Hj)(e U + e 2 W j ) 

+ e §£ (z(u) + e V + e 2 W 2 ), 

z(u + e P) + e Y(u + e P, v + e Q) = (I N + e |^)(z(u) 

+ £ V + e 2 W ) 

2 

+ E (e U + e 2 W ). 
au i 

Expanding in powers of e, we have 

e X(u,v) = e[U(u,v) + !£■ (u,v) z(u)] + e 2 [**], 

z(u) + e[|jj (u) P(u,v) + Y(u,v)] » z(u) + e[V(u) 

+ |J (u,v) z(u)] + e 2 [**] , 

where [**] denotes a smooth function of u, v, £ whose explicit 
formula is not needed here. Clearly, we must require that 
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X(u,v) - U(u) = (u,v) z(u), 

(2.7) 

(u) P(u,v) + Y(u,v) - V(u) = f^(u,v) z(u) . 

If the first equation is to have a periodic solution, the 
left side must have zero mean. From this requirement, we have the 
first equation of (2.5). If, in addition, we know that P(u,v) has 
zero mean, then in order to solve for Q(u,v), we must have the 
second equation of (2.5). 

Let us briefly consider vector equations of the type 

(2.8) F(u,v) = ff<u,v) z(u), 

where the given function F(u,v) has period 2tt in each v n * Since 
we seek a periodic solution, we expand both S and F in a Fourier 
series, 

s = Sj(u) exp 1 [j , v] , j_= ( j 1 J N ). 

(2.9) 

F = Ij Fj(u) exp i [j , v] , 

where 

“ l" Vn • 

Substituting (2.9) into (2.8) and equating the coefficients of 
exp i[j,v], we obtain an infinite set of equations. 


271 


LECTURES ON NONLINEAR RESONANCE 


F 0 (u) = 0, 

( 2 . 10 ) 

Fj( u) = iU.z(u)] Sj(u) . 

If for all u in the domain of interest, and for all integer 

vectors j , we have the nonresonance condition , 

(2.11) [j.z(u)] f 0, 

then we can solve for the S ^ ( u ) , and therefore for S(u,v). We 
obtain a formal series , 

(2.12) S(u,v) = - iU.z(u)]' 1 Fj(u) exp i[j,v] . 

The denominators U»z(u)] can become small as j becomes large, 
thereby preventing the convergence of the series (2.12). We avoid 
this difficult problem (the classical small divisors problem) by 
assuming that F(u,v), and therefore, S(u,v) are trigonometric 
polynomials. Note that to the particular solution of zero mean (2.12) 
we can add an arbitrary solution of the homogeneous equation, 

(2.13) 0 = |5{u,v) z(u) . 

Returning to (2.4), we see that if the nonresonance condition 
(2.11) is satisfied, and if U and V are chosen by (2.5), then the 
transformation (2.3) can be constructed. 

It is not always convenient to require that P(u,v) and 
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Q(u,v) have zero mean. For example, if the system (2.1) is 
Hamiltonian, then the solution of the homogeneous equation (2.13) 
can be selected so that the averaged equations (2.6) are Hamiltonian. 
John Morrison [ 8 ] has developed a generalized method of 

averaging in which the additive arbitrary functions play an essential 
role. Much of this section is based on his work. 

Before discussing the resonance problems (our main topic), we 
shall investigate in what sense the solutions to (2.6) determine 
approximate solution to the original equations (2.1). For simplicity, 
we shall consider scalar equations, i.e., N = M = 1. 

Let u(t) and v(t) be solutions of the exact equations 
(2.4), and u*(t), v*(t) be solutions of the approximate equations 

(2.6) with 

u(0) = u*(0) = a', v(0) = v*(0) = b'. 

Let 

(2.14) r = u - u*, s = v - v*, 

x* = u* + e P( u* , v*) , y* = v* + e Q(u*,v*). 

Then 

= e [U(u* + r) - U(u*)] + e 2 !^, 

(2.15) 

^j| = z(u* + r) - z(u*) + e [V(u* + r) - V(u*)] + e 2 !^. 
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We assume that in the domain of interest the given functions and 
their derivatives can be bounded by the same constant C. Then from 

(2.15) , we obtain the inequality 

t 

I r(t) | < e / C|r(t')| dt‘ + e 2 Ct , 
o 

(2.16) 

t 

|s(t)| < (1 + e) / C| r(t ' ) | df + e 2 Ct . 
o 

By the generalized Gronwall inequality (p. 11 of Sanone and 
Conti [ 9 ]) , we have 

| r( t) | < e [exp (e Ct) - 1] < e 2 Ct exp ( e Ct), 

(2.17) 

| s(t) | < (1 + e) [exp (e Ct) - 1] + e 2 Ct - e C(1 + e) t 2 /2 
< e Ct [2 exp (e Ct) + e - t] . 

Therefore from (2.3) and (2.14), we have the error estimates, 

| x ( t ) - x*(t)| < (1 + e C) | r(t) | + e C |s(t)| , 

(2.18) 

|y(t) - y*(t) I < (1 + e C) | s ( t) | + e C | r(t) | . 

On an interval 0 _< t T fixed, the error in the 

slow variable satisfies the inequality, 

(2.19) | x ( t ) - x*(t)| < e 2 t C*, 
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while the error in the fast variable satisfies the weaker inequality, 

(2.20) | y ( t ) - y*(t) | < e t C* , 

where the constant C* depends on the bounds C and T. 

In general, the estimates (2.19) and (2.20) are the best 
possible. Approximations which are meaningful on an infinite time 
interval can be constructed only under the most exceptional circum- 
stances. 

We now have developed the mathematical machinery for studying 
nonlinear resonance. 

DEFINITION. If there exists an x Q and k_ t 0 such that 

(2.21) [k,z(x Q )] = 0, 

then the condition for resonance motion has been satisfied. The 
degree of the resonance is the number of linearly independent integer 
vectors k which satisfy (2.21). 

A resonant problem can be reduced to a nonresonant problem by 
suitably reducing the nunber of fast variables. Let L denote the 
module of integer vectors k_ such that [k^,z(x Q )] = 0, and let 

ij . . . . ,1^, 1 < v < N 

be a basis of L, We can construct N - v linearly independent 
vectors 

k k 

— v+1 * * * * *— * 

perpendicular to L. Hence 
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(2.22) A = det IJCj k N | f 0. 

Set 

(2.23) q n = A' 1 [kfl.y] or q = Ky , 

where the N x N matrix K has as its rows the vectors k^/A . 

By construction, 

(2.24) [j_,Kz(x o )] = 0 

implies that the last N - v components of j are zero. 

The next step is similar to that used in boundary layer 
studies. We set 

(2.25) x = x Q + e 1/2 p . 

By virtue of (2.23) and (2.25), the original system of differential 
equations (2.1) is equivalent to 

% = c l/2 X(x o ♦ C 1 ' 2 p,K* 1 q) , 

(2.26) 

= Kz(x Q + e' /2 p) + e KY(x q + E 1/2 p,K _1 q) . 

Hence 

= e 1/2 X(x ,K _1 q) + e X^p.q.e^ 2 ) , 

at u 

(2.27) 

= Kz(x Q ) + e l/2 D(x 0 ) p + e Y^p.q.e) , 


276 


LECTURES ON NONLINEAR RESONANCE 


where D(x q ) = Kaz(x Q )/3x , etc. 

Since the components of K" 1 are integers, the system (2.27) 
has period 2 tt in each q n » Furthermore q l# ...*q as well as 
p«j,...p M are slow variables, while q v+ *j »• • • ,q^ are fast variables. 

Two features of the transformation of (2.1) to (2.27) should 
be emphasized. The equations (2.27) are valid in a neighborhood of 
x Q ; quoting from Brown (p. 8 of [ 2 ]) "resonance is not a single 
special case of motion but is a group of cases extending over a finite 
range of values of the constants." The second feature is a drop in 

l/2 

the order of the approximation; e rather than e is the 
perturbation parameter. 

We make one last change in notation. Let 

U = 

A = 

(2.28) 

= 

O) = 

Then 

dp - 

at 

(2.29) $ = 

d£ 
dt 


e * D < x o> = (*) • 

(9 j 9 V ) 1 X(x 0 . K ^ ) = R (X , >J> ), etc. 

(q v+ i>---.q N ) . 

([k v+1 .z(x o )],...,[k N ,z(x 0 )]) , 

U R(A,40 + y R J (p,A,<|),u) » 
y Ap + y 2 A (p,A,<J>,y) , 

uj + y $p + y 2 ^ (p ,A,4>,y ) . 
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Clearly, the first order (in y) averaged equations are (we 
forego another change of notation) 

2 7T 2 

= V R 0 (A), R 0 (x) = (2tt)' (N ' v) / ^R(x^) d^.d^ 

(2.30) ^ = y Ap, 


- w + y $P . 

Note that in both (2.29) and (2.30) A and $ are constant matrices. 
Furthermore, we have the nonresonance condition [j.to] = 0 implies 
j - 0. 

Let us consider the first two equations of (2.30). 



If Rq(A) vanishes at A * A q , then p = 0, A = A q is an equilibrium 
state of the system (2.31). The stability of the equilibrium state 
can be studied with the aid of the linearized equations, 


dt u 3X W ' V ’ 

(2.32) 

it (X - X 0 ) = y Ap • 
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The phenomenon of libration occurs if the equilibrium state is 
the center of a family of periodic solutions of (2.31). Just as with 
the linearized pendulum equations (1.7), the condition of resonance 
does not by itself insure that the motion has any special properties. 

For example, if the frequency of the unperturbed problem does not change 
with the amplitude, then A is zero, and libration is impossible. Hence 
Brown's claim, "the existence of the phenomena of resonance depends on 
the existence of this change." 

Before returning to the pendulum examples, one final observation 
should be made. The first order system (2.32) is equivalent to a second 
order system, 

(2-33) £i . M . 

dt 2 0 

If there exists a scalar function Z(x) such that 
AR 0 (X) = - grad Z(x) , 

then 

(2.34) 1 Ht X ] + v 2 Z(X) = const, 

is an integral of (2.33). 

3. Analysis of the pendulun problems 

As our first example, let us take the linearized pendulum 
equations (see (1.7) and (1.11)), 
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(3.1) 


cRf = ‘ sin 6 ’ 

if - “ + sf f <+> cos 6 ■ 


Note that we must have r nonzero. 

If 

(3.2) a) = ka, k an integer, 

the condition for linear resonance is satisfied. 

Following the procedure of the preceding section, we set 


x = (e - M/(i + k 2 ) , 

(3.3) 

4> = (ke + 1|<)/(1 + k 2 ) , 

r = r Q + up, r Q > 0. 

Then 

6 = X + k(J> , 

= - kx + <J> » 

and 
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P = - f (<f) - kX) sin (X + k(j>) , 

dt co 

dx _ u 2 f((j) - kx) cos (X + k<fc) 
dt u(r 0 + yp)(l + k 2 ) 

d£ _ + u 2 k f ( <j> - kX) cos (X + k<j>) 

dt oj(r Q + yp)(1 + k 2 ) 


The first order (in y) averaged equations are 
, 2ir 

^ Jr / fU - kX) sin (X + k<f>) d(J> , 
at O) ^tt ^ 

(3.5) = 0 , 


dt 


a . 


For simplicity, let us take f(i|0 = cos kip. From 
cos (k<j> - k 2 x) sin (x + k<j>) = j sin (1 + k 2 )X 

+ 1 sin [2k<J> + (1 - 

we have that the equations for the slow variables p and 
d| = - ^sin (1 + k 2 )X , 

(3.6) 


2 )X] , 
X are 
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Clearly, there are no periodic solutions of (3.6). It is important 
to note that the (unstable) equilibrium states are not of physical 
signi fi cance. 

The second example is the nonlinear pendulum equations (see 
(1.2) and (1.24)). In normal form, we have 

e f(i|i) z(r) || (r,e)[u) 2 sin r]' 1 , 
z(r) - e z(r) f(i|i) || (r,e)[(o 2 sin r]' 1 , 
oi , 

where 

z(r) = u(l - y|) + 0(r 4 ) , 

(3.8) 

x = F(r,0) = r cos 0 + 0(r 2 ) . 

It is essential that dz/dr f 0. 

We shall not derive (3.8) in detail, but only remark that if 

we set 

sin x/2 = sin r/2 sin 
then (see (1.18)) 

z(r) = a) — / (1 - sin 2 r/2 sin 2 t,Y 1 ^ dj; 

I ^ n 
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We can now easily derive the first equation of (3.8) by expanding the 
integrand in powers on r. 

Unfortunately, the transformation from x, x to r, e is 
singular at r = 0. For simplicity, we want r small, but if the 
equations (3.7) are to be meaningful, we must have r nonzero. We 
therefore assume that 

(3.9) 0 < e Y £ r 2 <_ e Y . 

The exponents of the bounds on r 2 will be chosen shortly. 

We now replace (3.7) by the simplified equations, 

& ■ - § f(K> sin 0 + 0(e r 2 ) , 

(3.10) ^ = “(1 - yg-) + ^ ?(♦) cos 0 + 0(e r 2 ) , 

$ “ «• 

The difference, and it is essential, between (3.1) and (3.10) is 
that the frequence of 0 depends on r. 

The condition for nonl inear resonance is that 

(3.11) uj(l - r Q 2 /16) = ka , k an integer, r Q f 0 . 

Assuming (3.11), we make the substitution (3.3) and obtain 
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J ^ f ( <(> - kA) sin (x + k<(>) + 0(pr 2 ) , 
at w 

.. ywr p p 2 ojp 2 P 2 f(4>-kA) sin(A+k<J>) 

(3.12) ^ = - + +0(y 2 r 2 ), 

dt 8(l+k 2 ) 16(l+k 2 ) u)(r 0 +yp)(l+k 2 ) 

d£ _ + k r_ m r p - p z + u 2 f(it-kx) sin (x+k<j>) 

dt (1+k 2 ) 8 0 16 w(r 0 +up) 

+ 0(y 2 r 2 ) . 

In the dA/dt equation we want the first term to be dominant 
even though r Q is small, i.e., we want yr Q » U 2 /r Q . Therefore, we 
require that 

(3.13) 0 < y 1_Y < r 2 < u 1_y/2 , 0 < y < 1/2 . 

The upper bound permits us to drop the 0(pr 2 ) and 0(y 2 r 2 ) terms 
in (3.12). Clearly, from (3.13), we obtain (3.9) 

We again set f(^) = cos kty. The first order averaged equa- 
tions are 

if = ■ it sin a + k2 ^ x • 

m»r 0 p 

8(l+k 2 ) 

pajkr 0 p 
8(l+k 2 ) 


(3.14) $ = 

# = 
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Let £ = (1 + k 2 )x + it, t = pit. Then the differential 
equations for the slow variables X and p are equivalent to a 
homogeneous pendulum equation, 

(3.15) ^- + a 2 sin c = 0, n 2 = r / 8 . 

dt 2 0 

Note that the independent variable is the "slow time," i = yt, 
and that the frequency Q_ depends on the amplitude r . 

The example of the pendulum with oscillating support can be 
discussed in more detail (see section III of Brown [ 2]), but we have 
displayed its most important properties. To summarize: because the 

frequency is amplitude dependent, libration can occur at resonance. 
The equations describing this libration are equivalent to the 
homogeneous pendulum equations, but are valid only over a finite time 
interval (of the order of e ' ). 

The two pendulum problem gives similar results. Here we set 

x = r 1 cos 0 X + CKr^), z 1 ( r 1 ) ~ ^(1 - r^/16), 

y = r cos 0 + 0(r 2 ) , z 9 ( r J = <*(1 - r 2 /16), 

^22 2. 2 Z 

(3.16) 

f = - u 2 (r cos 0 + r cos e ), 

1 1 1 2 2 

f =-a 2 (r cos 0 + r cos 0 ). 

2 v 1 1 2 2 


Then equations (1.6) are approximated by 
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d , v . 

-jr r = - ew (r cos e + r cos e ) sin e , 

Ut 1 1 12 2 1 

d , x . 

-nr r = - ea (r cos e + r cos 6 ) sin e » 

UL 2 1 2 2 2 2 

(3.17) 

4r 6 = a) (1 - r 2 /16) + (r cos 0 + r cos 6 ) cos 0 

Ul# 1 \ 1 2 2 1 

4r 0 = a (1 - r 2 /16) + (r cos e + r cos 9 ) cos 0 

ax 2 2 r 1 1 2 2 2 

2 

The condition for nonlinear resonance is 

(3.18) (1 - r i0 2 /16) = k 2 a (1 - ^ 20 2 /16) . 

We shall take k = k = 1. The analysis of other resonances is left 

1 2 

to the reader. 

Then with 


U - e 


_l/2 


, x = i (e - e J , 4 > ■ I (e + e ) , 


2 v i z * v 2 

(3.19) 

r = r + p 

1 10 K i 

we have (retaining the dominant terms) 


r 2 " r 20 + PP 2 * 


(3.20) 


If P, = - 1 fC r 10 sin 2(x+<(>) + r 20 sin 2<f, + r 2Q sin 2x] , 

ff P 2 = - t f L [r 2Q sin 2(x+ij>) + r iQ sin 2$ - r, Q sin 2x] , 

§• = ft t« r 20 p 2 - “ r io p i ] > 


d(t> 

dt 


a(l - r 2Q 2 /16) - ft ( u) r lo p i + ar 20 p 2 ) . 
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The averaged equations are 


d_ 

dt 


Pi 


= _ M r 
2 20 


sin 2\ , 


(3.21) 

$ “ 16 Ko P 2 - “ r io p . ] * 

it = “ (1 - r 20 2/16 ' ( “ r .0 P i + “ r 20 p « ) • 

Once again with c = 2X + tt, t = yt, we have the homogeneous 
pendulum equation describing the resonance phenomena, 

(3.22) + n 2 sin ? = 0, n 2 = (a 2 + u> 2 ) r r „/16 . 

dt 2 0 


The libration around ^ = 0 corresponds to an exact solution 

of the original equations (1-6) in the special case of equal lengths, 

i.e., a = u). For then, if x = -y, the two pendulums oscillate out 
of phase with exactly the same frequency. The support is motionless. 

If the lengths are almost equal, this "locking in" can be approximated 

if the initial displacements are chosen so that the resonance 
condition (3.18) is satisfied. The unstable equilibrium point, 5 = tt, 
corresponds to initial conditions x = y f 0. The support must then 
oscillate (preservation of linear momentum) --the subsequent motion 
of the pendulums is eratic. Finally, we note that the simplicity of 
equations (3.15) and (3.22) is due to the small amplitude assunption. 
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4. Motion of synchronous satellites. 

In this section, we shall study a typical satellite problem in 
order to establish the accuracy and the time interval of validity of 
the pendulum model which is derived by the method of averaging. 

The problem is the determination of the effects of asymmetries in 
the Earth's gravitational field on the motion of a synchronous 
satellite, i. e. , one whose mean motion is approximately equal 
to the rotation rate of the Earth. Because of the near equality 
of the two frequencies, the mean motion and the rotation rate, 
we have an example of nonlinear resonance where the effects 
of the longitude dependent asymmetries are amplified. This 
resonance has been carefully studied by L. Blitzer [ 1 ], B. Morando 
[6] and others. W. Kaula's textbook "Theory of Satellite Geodesy" 
[4] contains a clear exposition of the phenomenon. 

For simplicity, we shall ignore those terms in the potential 
which have little effect on synchronous satellites and write the 
potential as 


(4.1) 


where 


.J± 

r 

J, 


- V - V 
20 22 ’ 


V 20= P 2 (sin<p), 


V 22 = + g P 22 (sincp) cos 2(X - X 22 ), 


^ = the gravitational constant, 

r = ,the radial distance from the center of mass 
measured in Earth radii, 
cp = the latitude, 
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X = the longitude. 



P 2 (sin cp) = ~ (3 sin 2 cp-l), 

= 3(1 - sin 2 cp) 

Then, according to the standard theory, (Chapter 3 of [4 ]) if 
the mean secular rate (short periodic effects are suppressed) of 
the astromical latitude, is close to zero then the secular 
behavior of X^ is governed by the pendulum equation, i. e. , 

(4.2) X A = A (a, e, i) sin 2 (X A - X 2? ) 

where 

X , = u) + M + Q - 0 

A 

0 = yt + e , Greenwich sideral time, 
o 

A (a, e, i) = J ^ j (1 + cos i) 2 (1 - — e 2 + e 4 + . . . ). 

a 

Here the variables (a, e, i, id, 0, M) are to be interpreted as averaged 

or mean elements. It should be noted that the dominate asymmetry, 

V_ A , does not influence the secular behavior of X A . Furthermore, 
20 A * 

if 

(4. 3) X A = U) + 0 + M - 0 « 0 

and if and J are zero, then M - 0 a* 0, i. e. , we have near 
equality of the mean motion of the satellite and the rotation rate 
of the Earth. As usual 
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a = the semi -major axis, 

e = the eccentricity, 

i = the inclination, 

(D = the argument of perigee, 

G = the longitude of the ascending node, 

M = the mean anomaly. 

We shall start with the equations of motion and derive averaged 
equations corresponding to the pendulum equations (4.2) for a special 
class of orbits, namely, nearly circular orbits in the equatorial plane. 
This restriction is convenient because we want to consider the longi- 
tude dependent term as the perturbation and as the per- 

turbation parameter. In other words, the unperturbed potential is 

v = ^ + J 2 p 2< sin< e>- 

r 

For equatorial orbits, sincp = 0, and the unperturbed problem is an 
integrable central force problem. If we make the additional restriction 
that the unperturbed orbit is geometrically circular, then the algebraic 
details will not obscure our purpose, the application of the method of 
averaging to typical nonlinear resonance problems. 

With the center of mass of the Earth as the origin of an inertial 
coordinate system, we set x = rcosw, y = r sinw. Then 

1*2 2*2 

T = ^ [r + r w ], the kinetic energy, 

(4.4) V = ^T ' -- ^3 ^2 " 3^22 cos the potential energy, 

2r r 

L = T - V, the Lagrangian, 

where (compare with (4.1)) we have set sincp = 0, X ^2 “ and ^ = w - 0. 
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The equations of motion are 


_d_ 9 L_ 8L 

dt d r dr 

d_ dL_ dL_ 

dt aw " 9w 


or 


(4. 5) 





cos 2X 



-^ 6J 22 

3 

r 


sin 2 X 


We now take w as the independent variable and X, c, u, du/dw 
as dependent variables, where 


(4.6) 


2 dw 

X=w-6,c = r u 


1 du_ 
r * dw 


We verify that 


(4. 7) 


d 2 u 


dw 


dc 

dw 


+ u = 


2 + 2 2 h u 

c 

_1 du dc 
c dw dw * 


-g. 6 J, 


22 


u sin 2X , 


2 ^ J 22 u cos 2X 


dX 

dw 


= 1 - 


The equations of motion have equilibrium solutions which 
correspond to circular orbits with 24 -hour periods. They are de- 
termined by the transcendental equations obtained by setting the 
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derivatives equal to zero in (4.7), 

u = [1 + | J 2 U 2 + 9 J 22 U 2 cos 2X], 

c 

(4.8) 0 = - ^ 6u J 22 sin 2X , 



Hence, 

X — 0 , i — , i TT , 

« = ^ • 
u 

and u is the positive solution of 

u = i V 1+ f J 2 u2+ 9J 22 a2 l- 

y 

If we were only interested in studying solutions near the equilibrium 
solutions, then we would introduce normal coordinates relative to each of 
the solutions of sin 2X = 0. However, if X is to be unrestricted, it is 
more convenient to set - 0 in (4.8) and define our unperturbed orbit 

by the equations, 

(4. 9) u = - [1 + \ J 9 u 2 ] , y = c u 2 . 

' ' o 2 L 22 o J 1 oo 

c 

o 

It should be noted that this orbit has nonzero instantaneous eccentricity, 
but is geometrically circular. 

We now introduce variables p^,p 2 ,p^ which correspond to deviations 
from the resonant amplitudes of section 2. Let 
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(4. 10) 


Then, 


(4.11) 


o r 


(4.12) 


where 


Ml 

2 


= -*2 + ^2 


c 

o 


p 3 * 


Tr 2 [ p 3 + p i 


cos w + p^ sin w]. 


= 0 + JT7J0 - p. sin w + p cos w]. 
dw 22 L 1 2 


using (4. 9), we find that 


dp 3 

dw 


= '/J^B(p 1 ,p 2 ,p 3 ( X,w; J 2 . J 22 ) 


dp l dp 2 

_ cos w + __ sin w = /J 22 C( Pl , p 2 . p 3 , X, w; J 2 , J^), 


d^T sin w + d^T cos w = /J 22 D(P 1 ,P 2’ P 3 ,X ’ W; J 2* J 22 )i 


dPi 

= /jTT { C cos w - D sin w } 

d w 22 


dp 


— = /r~ { C sin w + D cos w} 
dw 22 


d P 3 

dw" ' v J 22 B ’ 
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F rom 


Jl 

2 

c 


J. 

c 

* ( 
with 
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2 

B = -C = 12 u sin 2 A. , 


o 

3 T . M- 2 \i 2. 

= 2 J 2 ( 2 U ■T'o’ 

c c 


+ 9 J 22 “2” u2 cos 

c 


1 du dc 
c dw dw 


(4.7) and (4.10), we have 


u2 " T u o = ^hz {p 3 a o + 2u o^ (p 3 + 6) 
c c 


* 'JS ^">3 * » * J 22 Pj<P3 * 61 1 


^V 2 (« 22 'T * ■'hzH'K* 'hz * 3 


sin w + p^ cos w) sin 2\ } , 


6 = p^ cos w + p^ sin w . 


+ 6 )) 


Therefore. 
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/ jTI D = J 2 { (u^ + 2 u o -Jj ) P 3 + 2 u o (Pj COS w + P 2 sin w) 

C C 

o o 

+ ( /J 22 J T + J 22 p 3 )(p 3 + P 1 ( r os w + p 2 sin w)2 ^ 

c 

o 

+ 9 /J^ + P 3 )(u o + /J^2 (P 3 + Pi cos w 


(4.14) 


+ sin w ) 2 cos 2 X + 6 + ^22^3^ U o + ^22 


(P 3 + P^ cos w + P 2 sin w )) (“Pj^ si n w + P 3 cos w ) sin 2) 


= J 2 {( u 2 + 2 u q -y ) P3 + 2 u 0 “2" * P 1 COS w + p 2 sin 


+ 9 ^22 2 U o COS 2X + 0 ^ J 2 ^22 + J 22 ^* 


After simplification, we have 


dp, 


- 5 — = - /J,, { 9 -(=■ u cos 2X sin w 
dw 22 L 2 o 

c 


+ 12 u sin 2X cos w} 

4 o J 


-J, {( U 2 + 2 u 4) 


+ 2 u q -y (pj cos w + p^ sin w) sin w} +0 (J^ /T^ + ^ 22 ^ 


(4.15) 


dp 2 r^— r q |i> 

d^T = ^ {9 ~2 U o COS 2X COS W 

c 

o 


(cont'd) 
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2 

- 12 u sin Z\ sin w} 

2 o J 

c 

o 

+ J- { (u 2 + 2 u ) p~ cos w 

2 o o 2 r 3 

c 

o 

+ 2 u (p^ cos w + p£ sin w) cos w} 

° c 

o 


+ 0 (J 2 + J 22^ 


dp 


■ 3 — = ■/ J ~ { 12 u sm 2X } +0 (J 

d w 22^ 4 o 

c 

o 


22 


Furthermore, from (4. 7), (4.9), and (4. 10), we have 

2 

c n v 

1 


dX 

dt 


(4.16) 


c u 

y _ i o q 

2 “ 1 " 2 

cu cu 


2 \i 
c - \ 


i - 1 1 + /J 22 ^ p 3 y 

2 


1+ -^ <p 3 + 6 > 


= ^2 {( ^r - 4 )p 3 + iT <pi cos w + ?2 sin w)} 


+ o (j 22 ) 


Since and 


(4. 17) — ^ = c u 2 = y + 0 ('/J^ 2 ), 


w is a fast variable, and P 1 »P 2 >P 3 » ^ are slow variables. We now 
average with respect to w and use (4.17) to obtain the first order 
(in ^2 ) averaged equations, 
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d P 


dF = - J 2 {vu o 4 p 2 } ’ 


dp 2 

dt 


J 2 { Y u o "T p l } 


(4.18) 


dp 3 

"a r 


= /J7 2 { Y i2 xi n sin 2 A } , 


2 o 


dA 

dt 


^2 ^< 3 - 


2 Mi 


3> p 3>‘ 


The qualitative effect of V^, the longitude dependent term in 
the potential, is now easy to describe. From (4.18), we have 


(4.19) 


p^ = s cos T| (w - w q ), = s sin T] (w - w q ), 


P 3 = f 1^2 Y 12 ^ u q f sin 2 A (w' ) dw 1 , 


where T\ = y u q M-/c q » s » W Q are integration constants, and 
A is a solution of the pendulum equation, 


d 2 X 


dt‘ 


= G(y , J2» ^22^ s * n ^ ^ * 


where 


? 2 
^ 2 c u 

X n £ (i / ^ O O 

G = J 22 12 Y ~4 < 2 -—3— 


and u q is the positive function of y and determined by (4.9). 
From (4.10), we have 
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j. 

r 


= — + f p 3 + P 1 COS w + p 2 sin w l' 


Therefore, the radial distance will oscillate about its mean value. 

The behavior of \ , the longitude of the satellite, is more interesting 
since it is strongly influenced by initial conditions. If X is near one 
of the unstable equilibrium values (0 or tt), or if is large enough, 

the satellite will slowly drift around the Earth. On the other hand, 
if X is near one of the stable equilibrium values ( + it or - it) and if 
is small enough, then the satellite will librate about the equili- 
brium value. The libriation period can be shown to be proportional 
to 1/7^22 J constant of proportionality is dependent on the initial 
conditions. 

Finally, we note that the pendulum equations (4.2) and (4.20) 
are compatible since 


2 

c 

o 

M- 


a (1 - e ), — = a (1 - e ) 
o' o u o' o 

o 


2 


y 


3 

a 


o 


and from (4.9) it follows that e Q = OfJ^). 

In conclusion, we note that since the pendulum equation (4.20) 
was derived by the method of averaging, the error estimates of section 
2 are applicable and we can now assert that the pendulum model is 
valid over a time interval proportional to l/ ^J^, e. g. , over a libra- 
tion period. Statements of this type must be accompanied by the phrase 
"if J ^2 is sufficiently small. " However, numerical tests (which 
will be discussed in another report) show that the theory can be used 
for synchronous satellites of the Earth. 
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